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PREFACE 


Preface 

The aim of this book is to present fundamental concepts in particle physics. This includes topics such as the 
theories of quantum electrodynamics, quantum chromodynamics, weak interactions, Feynman diagrams and 
Feynman rules, important conservation laws and symmetries pertaining to particle dynamics, relativistic field 
theories, gauge theories, and more. In addition to explaining the underlying theories in a detailed manner, we shall 
also provide a number of examples that will illustrate the formalisms "in action". 

This book is primarily based on my lecture notes from teaching this class to university students over several 
years, and the notes are in turn based on the excellent book "Introduction to Elementary Particles" by D. Griffiths, 
which they follow closely in structure and choice of examples. I have also included some additional topics and 
instructive examples which hopefully will allow the reader to obtain a more thorough physical understanding of 
the material. This book is suitable as material for a full-semester course in introductory particle physics. 

It is my goal that students who study this book afterwards will find themselves well prepared to dig deeper into 
the remarkable world of theoretical physics at a more advanced level. I welcome feedback on the book (including 
any typos that you may find, although I have endeavored to eliminate as many of them as possible) and hope that 
you will have an exciting time reading it! 


Jacob Linder (jacob.linder@ntnu.no) 

Norwegian University of Science and Technology 
Trondheim, Norway 
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I. OVERVIEW OF ELEMENTARY PARTICLES AND THEIR INTERACTIONS 

Learning goals. After reading this chapter, the student should: 

• Be able to describe the four fundamental interactions, their mediating particles, and which particles that can 
participate for a given interaction. 

• Understand how to read basic Feynman diagrams. 

• Account qualitatively and quantitatively for neutrino oscillations. 


In the so-called Standard Model of particle physics, matter is believed to consist of elementary particles. One often 
makes the distinction between matter and mediators (also known as force fields), so that although the following 
three groups are all elementary particles: 

• Leptons 

• Quarks 

• Mediators 

only the leptons and quarks (collectively known as fermions) comprise matter. In this book, you will get 
acquainted with all of these particles and how they interact with each other. We will begin by introducing the 
fundamental forces by which elementary particles interact and represent their interaction via Feynman diagrams. 
Our treatment will mostly be qualitative to begin with in order for the reader to get an overview of the situation, 
and then we will treat each scenario in detail mathematically. The appropriate physical theory to be used when 
considering physical phenomena in a system depends on which length scales and velocities we are considering. 
These quantities should be compared with the de Broglie wavelength A of the particles and the velocity v of the 
particles, as illustrated in the figure below. In particular, A roughly marks the transition for the length scales where 
a quantum mechanical description is required. Similarly, if a particle is moving with a velocity close to the speed 
of light c, a relativistic description is needed. In this way, region III describes length scales which are small or 
comparable to A and velocities that are slow compared to c, whereas region IV describes length scales that are 
equally small, but where the velocity is comparable to c. 


Length scale 



A. The fundamental interactions 

Each of the four fundamental interactions are believed to be mediated by the exchange of a particle: a so-called 
mediator. This is well-established except for the gravitational force, as there is no compelling evidence as of today 
for the existence of a graviton. 
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Force Theory Mediator 

Strong Chromodynamics Gluon 

Electromagnetic Electrodynamics Photon 

Weak Flavordynamics and Z° 

Gravitational General theory of relativity Graviton? 

We will start by considering quantum electrodynamics (QED), which is the oldest, simplest, and most successful 
(in terms of comparison between theory and experiment) of all the above theories. The others are to a large extent 
in fact modelled on QED. 


B. Quantum electrodynamics 

All electromagnetic phenomena can ultimately be reduced to the following elementary process: 



The diagram should be read as: a charge e enters, emits/absorbs a photon 7, and exits. The charged particle can 
be a lepton or a quark. For more complicated processes, one patches together several such so-called primitive 
vertices. For instance, the interaction between two electrons (Moller scattering) can be drawn as: 



Diagrams of this type are known as Feynman diagrams. Note that a particle running backward in time (as indicated 
by the arrows) must be interpreted as the corresponding antiparticle moving forward in time. In fact, a universal 
feature of quantum field theory is: 


For every kind of particle, there exists an antiparticle with equal mass and opposite charge. 


There are six so-called flavors of leptons and these flavors form three generations : e~ and z/ e , and r~ 
and v T . Here, is a muon, r~ is a tau particle, while Vj are neutrinos of a particular type j = {e,/i,r}. 
Notationwise, antiparticles are denoted with an overbar .?. or by specifying their charge explicitly: n for neutron, 
n for antineutron and e~ for electron, e + for positron. Note that since the neutron is not electrically charged, one 
could ask how its antiparticle is physically different. As we shall see, it turns out that particles such as neutrons 
carry other quantum numbers that also change sign for antiparticles. 
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In the above figure, an electron and positron annihilate to form a photon and then produce and e~ — e + pair. The 
photon is its own antiparticle. The diagram then represents the interaction between two opposite charges, namely 
their Coulomb attraction. Another diagram also contributes to this process: 



Diagrams with two vertices can also be used to represent several other processes: 

Pair annihilation Pair production Compton scattering 

e_ + e+ —^ 7 + 7 7 + 7—^e~+e+ e~ + y e~ + y 



There is no need to explicitly specify the + or — charge on the external lines: the arrows suffice to indicate whether 
it is a particle or antiparticle. Note that all of the above processes are related via a so-called crossing symmetry. 
Suppose that A + B ^ C + D is allowed. Then: 

A ^ 5 + C + D, A + C ^ 5 + D, C + fi-td + fi (1.1) 

are in principle also allowed. However, an important caveat is that conservation of energy must be fulfilled. For 
instance, if the masses satisfy ttia + me + m^, then A —>> B + C + D cannot occur. The reason is energy 

conservation. A massive particle has rest energy (we will have more to say about this in our treatment of the 
special theory of relativity later on) and thus needs to be sufficiently large in order to enable the reaction. This 
is an example of a process which is not kinematically permissible, meaning that it does not satisfy conservation of 
for instance energy or momentum. 

In terms of Feynman diagrams, processes which are related via crossing symmetry can be obtained by twisting or 
rotating the diagrams. We find additional possibilities if we allow for four vertices: 
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The common feature of all these diagrams is that two electrons enter and exit. The internal lines represent virtual 
particles that cannot be observed: they describe the precise mechanism of the interaction. It should be emphasized 
that diagrams are symbolic in the sense that they do not represent actual particle trajectories. Each Feynman 
diagram is actually associated with a particlar complex number Ai that may be computed by using Feynman rules. 
We shall say much more about this later when we develop our quantitative theory of particle interactions. Assume 
now that you want to analyze a particular physical process. The standard procedure is then: 

• Draw all Feynman diagrams contributing to this process (with appropriate external lines) that have 2 vertices, 
then all with 4 vertices, and so on. 

• Evaluate the contribution of each diagram via Feynman rules. 

• Sum all these contributions: the result represents the actual physical process. 

One aspect of the above recipe seems problematic: there are infinitely many Feynman diagrams, since we 
can construct higher and higher order contributions by introducing additionial vertices! We cannot consider 
all of these. The solution is that each vertex in a diagram introduces a factor a = e 2 /he ~ 1/137: the 
fine structure constant. Therefore, higher order diagrams fortunately contribute less and less the more ver¬ 
tices they have. In practical calculations, it is often sufficient to include diagrams up to four vertices. Feynman 
rules include conservation of energy and momentum at each vertex (and thus automatically for the whole diagram). 

The primitive QED vertex thus cannot represent a physical process: e~ —>> e~ + 7 violates energy conservation 
since the electron does not have enough of the aforementioned rest energy to sustain the existence of both an 
electron and an additional photon 7. The same problem exists for e~ + e + ->7 where momentum conservation 
cannot be fulfilled: in the CM system, e~ and e + enter with equal and oppsite velocities so that p tot = 0. However, 
the final momentum cannot be zero since the photon 7 travels at the speed of light. It is important to note that 
virtual particles (internal lines in the diagrams) do not necessarily satisfy the same relation between energy, mass, 
and momentum as the corresponding physical particle. Instead, their mass is whatever the conservation laws at 
each vertex requires. In contrast, external lines represent real particles and must have the correct mass. 


Download free eBooks at bookboon.com 


7 




INTRODUCTION TO PARTICLE PHYSICS 


OVERVIEW OF ELEMENTARY PARTICLES 
AND THEIR INTERACTIONS 


C. Quantum chromodynamics 

In quantum chromodynamics (QCD), color plays the role of charge in QED. The fundamental process (primitive 
vertex) is quark —)> quark + gluon. There are six types of quarks, also referred to as flavors : up, down, strange, 
charm, top, and bottom ( u , s, c, £, b). 



Leptons have no color and thus do not interact strongly. The force between quarks (responsible e.g. for binding 
quarks to make baryons) has the lowest-order diagram: 



Note how the gluon is represented by a curly line whereas the photon is represented by a wavy line. A key 
difference from QED is that there are three kinds of color (instead of d= charge) and color may change in a process, 
although the total color must be conserved. In the example shown below, the gluon carries away one unit of blue 
and minus one unit of red: 



Note that the flavor of the quark may not change. A negative unit of color is denoted by an overbar, so that a 
negative unit of red color is f. Gluons are therefore "bicolored": they carry one positive and one negative unit of 
color. This suggests that 3x3 = 9 possible gluons should exist. For reasons that we will come back to, there 
are nevertheless only 8. Moreover, individual quarks do not appear freely in nature, but quark-antiquark pairs 
called mesons exist. For now (we will later bring out a nuance in this statement), we may write that all naturally 
occuring particles are colorless. We define colorless as either the total amount of color being zero or that the three 
colors are present in equal amounts. 

Since gluons carry color (unlike photons that do not carry charge), they may couple to other gluons: 



This g — g coupling makes QCD richer and a lot more complicated than QED. Another essential difference is 
related to the size of the coupling constant. In QED, we stated that each vertex introduced a factor a cx 1/137. 
In contrast, the coupling constant for strong forces as is (under some circumstances) greater than 1. It may be 
determined from e.g. the force between two protons. This leads to a problem: higher-order vertices and diagrams 
contribute more and more! The resolution to this, as we shall discuss in more detail later, is that as is in fact not 
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a true constant. It rather depends on the separation distance between the interacting particles, giving rise to the 
phenomenon of asymptotic freedom. For large distances, as becomes large. For small distances (less than the size 
of a proton), as becomes small. Therefore, within a proton or pion, quarks move around without interacting much. 

The proton and neutron are comprised of three quarks, and such a composite particle is known as a baryon. A 
composite particle consisting of one quark and one antiquark (a qq pair) is known as a meson. Baryons and 
mesons are thus both quark-based composite particles which collectively are known as hadrons. 

This distance-dependent effective coupling actually appears in electrodynamics as well in the form of charge 
screening. 


+ 



+ 


The "halo" of negative charge from molecular dipoles partially cancel the field from q > 0. Thus, the effective 
charge is reduced to g e ff = q/e where e is the dielectric constant of the medium. It measures how easily a substance 
becomes polarized by an electric field E. 


QeS 



separation 


Now, in QED the vacuum actually acts as a dielectric due to the production of e — e + pairs that interacting in the 
following ways: 
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The key aspect to note in these diagrams is that the virtual electron in each bubble will be attracted to a positive 
charge q , whereas the positron is repelled. As a consequence of the screening occurring even in vacuum, what 
is measured in experiments is the screened effective charge. The same thing happens in QCD, but with an extra 
ingredient: gluon-gluon vertices. Therefore, in addition to diagrams of the type (a) and (b) above (where the 
photons should now be exchanged with gluons), we also have to include 




It is not obvious what the influence of these diagram will be, but it turns out that their effect is opposite: they 
drive down as at short distances, in contrast to quark polarization diagrams which enhance as at short distances. 
These effects then compete. The winner of the competition is determined by comparing the relative number of 
flavors (quarks) and colors (gluons). Detailed calculations beyond the scope of this textbook reveal that the critical 
parameter is a = 2/ — 1 In where / is the number of flavors and n is the number of colors. In the Standard Model, 
/ = 6 and n = 3, leading to a = —21. Therefore, the coupling decreases at short distances as stated previously. 
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Keep in mind that in contrast to charge, no naturally occurring particles carry color: quarks must be confined 
to colorless packages of two (mesons) and three (baryons). As a result, strong interactions between naturally 
occuring particles may be quite complicated. Consider for instance the strong force between two protons. One of 
the contributing diagrams is: 



Effective 7r° exchange 


It is worth mentioning that we here see remnants of an early model for strong interactions introduced by the 
physicist Yukawa, who conjectured that the pion (and not the gluon) was the mediator of the strong force. In 
reality, the diagram shows that the interaction is much more complicated. 


D. Weak interactions 

Unfortunately, there is no particular name for the property that produces weak forces, but whatever you call it: 
quarks and leptons have it. Two types of weak interactions exist. Charged weak interactions are mediated by W ± 
bosons while neutral weak interactions are mediated by the Z°. For leptons, the fundamental charged vertex is: 



A negative lepton l converts into the corresponding neutrino and emits W (or absorbs VF + : the diagram can 
mean both things). As before, we combine primitive vertices for more complicated reactions: 



This type of neutrino-muon scattering event is hard to set up in the laboratory, but a slight twist gives us a muon- 
decay diagram that occurs all the time: 
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The fundamental neutral vertex for leptons is: 



Here, l is any lepton including neutrinos. Neutral interactions were difficult to discover experimentally, being 
masked by much stronger electromagnetic interactions (we will see later why the electromagnetic ones usually 
dominate): 



Therefore, to observe a purely weak interaction one has to consider neutrino scattering since there is no electro¬ 
magnetic contribution to such a process. In terms of notation for the W ± and Z° bosons in Feynman diagrams, 
it is worth remarking that another frequently encounted convention exists where these particles are drawn as wavy 
lines rather than dotted lines, similarly to the photon. Here, we stick with the dotted line convention to distinguish 
them more clearly from photon-mediated diagrams. Turning to how quarks interact weakly, recall first that leptonic 
weak vertices connect members of the same generation: e~ connects to z/ e , but never z/^, for instance. Thus, we 
have conservation of electron, muon, and tau numbers {L e , L r } (see table below). Antileptons have all signs 
reversed. 


Lepton 

l Q 

L e 

L, 

Lr 

e 

-1 

1 

0 

0 


0 

1 

0 

0 


-1 

0 

1 

0 


0 

0 

1 

0 

T 

-1 

0 

0 

1 

V T 

0 

0 

0 

1 


For quarks, the fundamental charged vertex would be natural to assume looks as follows: 


W~ 


A quark with charge — | (the d, s, b quarks) converts into the corresponding quark with charge | (the u,c,t quarks) 
and emits a W~. The outgoing quark carries the same color as the incoming one, but has a different flavor (type 
of quark). Importantly, the W~ does not carry the missing flavor. In fact, W± has no flavor since it must be able 
to couple to leptons. Therefore, 
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Flavor is not conserved in weak interactions. 

The W~ in the above diagram can couple to leptons (a semileptonic process) or other quarks (a purely hadronic 
process). An example of a semileptonic process is the following: 


w~ 




This process would never appear isolated in nature due to quark confinement, which we have discussed. However, 
if we turn it around to represent pion-decay, we obtain a feasible process as shown in (a). 
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The same type of diagram holds for the beta decay of the neutron, as shown in (b). Since quarks interact both 
weakly and strongly, hadronic (quark-based) interactions can have both a weak and strong contribution. Consider 
for instance A 0 —>> + tt~ : 


Weak contribution 


Strong contribution 



The strong contribution dominates completely in magnitude. As for the fundamental neutral vertex for quarks, it 
would be natural to suppose that it looks like: 



We may then use this to construct neutrino-scattering processes such as + p + p: 


u u d 




In essence, the above suggests that quarks mimic the leptons as far as weak interactions are concerned. However, 
we must make an important modification to what we have stated. To see why, consider the fact that if the fun¬ 
damental quark vertex may only couple quarks within the same generation [similarly to leptons, the quarks also 
form three generations (u, d), (c, s ), (£, &)], we cannot describe any strangeness-changing weak interactions such 
as changing a strange quark into an up quark. Yet such processes do exist in nature, with two examples shown 
below: 
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A — y p^ it Q A + K 



The solution to this dilemma was suggested by Cabibbo in 1963, and we shall treat this issue quantitatively in the 
chapter on QCD later on in this textbook. For now, we note that strangeness is a property (like charge or color) 
that is assigned to each particle. Strangeness is conserved in strong interactions, but not necessarily conserved in 
weak interactions. 


Summarizing, the primitive vertices giving rise to physical processes via weak, strong, and electromagnetic inter¬ 
actions are the following: 

Strong EM Weak 



We also give a summary of how to characterize particles in terms of their spins in the table below 


Elementary 

particles 


Composite 

particles 


Bosons (integer spin s ) 

Fermions (half-integer spin s) 

Spin 0 

Spin 1 

Spin \ 

Spin | 

Higgs particle 

Mediators 

Quarks, leptons 

n/a 

Pseudoscalar 

mesons 

Vector 

mesons 

B ary on 
octet 

B ary on 
decouplet 


Weak and electromagnetic couplings to W ± and Z° 

As a final note, just as there exist gluon-gluon couplings in QCD, the W ± and Z° may couple to each other and 
the photon: 
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E. Briefly on neutrino oscillations 

Neutrinos are notoriously difficult to observe, unfortunately, in particular since they are so light and electrically 
neutral. In the mid-1950s, Cowan and Reines performed an ingenious experiment where the idea was to look for a 
reaction triggered by neutrinos rather than trying to observe neutrinos as a byproduct of a reaction. In particular, 
they looked for inverse /3-decay: 


z/+p + —)>n + e + (1.2) 

and successfully confirmed the existence of neutrinos. However, there turned out to be a problem associated with 
comparison between theory and several experiments. The experiments only saw about 1/3 of the theoretically 
predicted amount of neutrinos produced by the Sun, a mystery which was dubbed the solar neutrino problem. 
Pontecorvo suggested in 1968 a bold and elegant solution to the problem: what if the v e produced by the Sun were 
transformed into something else on their way to Earth which could not be detected by the experiments at hand, 
such as Vy (the experiments were specifically set up to measure v e )l This could be achieved via so-called neutrino 
oscillations. 

To illustrate this idea, consider a scenario where we have two neutrino flavors v e and Vy. If indeed one may 
spontaneously convert to another, neither can be an eigenfunction of the Hamiltonian. Instead, the stationary states 
of the Hamiltonian describing the neutrinos should be a linear combination of them: 

v\(t) = cos 6vy{t) — sin Qv e {t), ^(Z) — sin Ovy(t) + cos Qv e {t). (1.3) 

This form ensures orthonormality of the states v\ and 02 , which by virtue of being stationary have a simple time 
dependence: 


Vi(t) = 0)e~ iEit / n , Ei = yk 2 c 2 + to 2 c 4. (1.4) 

Here, Ei and ra* is the energy and mass of neutrino type i. We note immediately that it follows that v e and Vy do 
not have well-defined masses since they are linear combinations of v\ and 02 . 
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To see how a flavor state evolves with time (< e.g . from production in the Sun and propagating toward Earth), assume 
that we start with v e neutrinos: 


V e (t = 0 ) = 1 , Vy(t = 0 ) = 0 . 

This gives 

i/i (0) = — sin#, 1 / 2 ( 0 ) = cos <9. 

Now, rewrite Eq. (1.3) to express {z/ e , as a function of {z/i, z/ 2 } to obtain 

Vy(t) = cos 0 z/i(t) + sin 0 i/ 2 (t) = sin# cos + e -l£;2t//;i ). 

We then find the probability that z/ e has converted into z/ M after a time t: 

sin 2 20 




4 

n 2 2(9 


(1 _ ^(Ez-E^t/h _ e ~i(E 2 -E 1 )t/h -j_) 


/| • A f -^2 ~ ^1 

-4 sin —— 1 

\ 2h ) 

E2 — Ei 


sin 2# sin 


f -0^2 ~ ^1 + y 

V 2ft V. 


(1.5) 


( 1 . 6 ) 


(1.7) 


( 1 . 8 ) 


In this model, we have thus achieved precisely a conversion between z/ e and z/^ which depends in an oscillatory 
manner on t (or equivalently the distance travelled). There are two necessary ingredients for neutrino oscillations 
to occur: 

1. A finite mixing angle 6 ^ 0. 

2. Different masses for the eigenstates Uj (both cannot be zero). 

Today, there is compelling experimental evidence (which was also awarded the Nobel Prize in physics in 2015) 
that neutrinos are not massless - in hindsight, one could even state that there actually is no fundamental reason for 
why they should be (unlike the photon, for which there exists a very good reason that its mass must be zero). In 
practical calculations where z/-oscillations are not of relevance, however, we will often approximate their mass to 
zero which still yields excellent results. 


II. CONSERVATION LAWS AND SYMMETRIES 

Learning goals. After reading this chapter, the student should: 

• Understand the relation between continuous symmetries and conserved quantities. 

• Be able to explain what a symmetry and a symmetry group is, and account for how isospin, parity, and 
charge conjugation symmetries work. 

• Explain the difference between SU(2) and SO(3) and in which physical settings these groups are used. 


A. Decays and conserved quantities 

A general property of elementary particles is their tendency to decay: 

Every particle decays into lighter particles, unless prevented by some conservation law. 
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Neutrinos and photons are stable due to their low and vanishing mass, respectively: there is simply nothing lighter 
to decay into. Strictly speaking, however, there is a finite probability for a process of the type i 7 Vj + 7 if 
the mass eigenstates for the neutrinos have masses that satisfy m Vi > m Vj . The electron is the lighest charged 
particle, so that conservation of charge makes it stable. The proton is the lightest baryon, so that conservation of 
baryon number saves it. Baryon number A is assigned so that baryons have A = +1 (neutrons, protons, et.c.) 
wheres antibaryons have A = — 1 (non-baryons have A = 0). The positron and antiproton are also stable for the 
same reasons, but all other particles may spontaneously disintegrate. Even quarks decay: this is e.g. what happens 
for the weak decay n —)> p + e~ + v e . A given decay is governed by one of the three fundamental forces we have 
discussed: 

• A ++ —>> + 7r + is a strong decay. 

• 7 T° —>> 7 + 7 is an electromagnetic decay. 

• n + e~ + v e is a weak decay. 

How can we know this? If a photon comes out, the process is certainly electromagnetic. If a neutrino emerges, 
the process is certainly weak. If neither 7 or v emerge, it is more tricky to determine the origin. For instance, 
E - —)> n + 7r _ is weak, but A - —>> n + tt~ is strong. The reason for this is that in the decay strangeness is 
changed, which does not happen in strong interactions. Weak interactions can conserve strangeness, however, and 
in that case one can look at the decay time in order to distinguish the weak and strong interactions. This is in fact 
the experimentally most dramatic difference between the various types of decays: 

Type of decay Typical lifetime 

Strong ~ 10 -23 s 

EM - 1(T 16 s 

Weak Ranges from 10“ 13 s (for r) to several minutes (for n) 

A decay of a given type in general proceeds more rapidly the larger the mass difference is between the original 
particle and the decay products (although exceptions exist). It is this kinematic effect that causes the great spread 
in the weak interaction lifetimes. The definition of the lifetime r is related to the half-life ti/ 2 via 


t 1/2 = (In 2)r ~ 0.693r. (2.1) 

In turn, the half-life is the time required for half the particles in a large sample to decay. 

Different types of conservation laws exist which dictate whether or not a reaction is possible. The kinematic 
conservation laws are concerned with energy, momentum and angular momentum. These apply to all interactions : 
strong, weak, and electromagnetic. In addition, there are additional conservation laws that apply to each vertex in 
a Feynman diagram. A quantity conserved at each vertex is automatically conserved for the total reaction. 

1. Conservation of charge. Note that the W ± bosons can carry away charge. 

2. Conservation of color. Electromagnetic and weak interactions do not affect color, whereas gluons can carry 
away color. 

3. Conservation of baryon number. As mentioned before, A = +1 for baryons, A = — 1 for antibaryons, 
and A = 0 for everything else. The origin of this rule can be traced back to conservation of quark number. 

4. Conservation of lepton generation number. Strong interactions do not influence leptons, whereas electro¬ 
magnetic interactions only makes the particle emit a photon. Weak interactions mix leptons within the same 
generation (e, /i, r). 

5. Conservation of flavor in strong and electromagnetic interactions. Flavor is not conserved in weak 
interactions. 

It is also worth mentioning that unlike leptons and baryons, there is no conservation of mesons (a quark-antiquark 
pair qq). 
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B. The concept of symmetry and symmetry groups 

Symmetry is an immensely powerful concept in physics and mathematics. Consider for instance the following 
graph. 



Even if we do not know the functional form of f{pc ), we know a symmetry property it has, namely f(x) = —f(—x ). 
It then follows immediately that, for instance, [/(— x)] A = [ f(pc)] 4 and that ^\ x= 2 = j^\ x =- 2 - We actually know 
a lot about the mathematical properties of f (x) simply by observing that it is antisymmetric. The concept of 
symmetry has profound implications in particle dynamics. A milestone was set in 1917 when Emmy Noether 
published her famous theorem: 


Noether’s theorem: Continuous symmetry Conservation law. 


We will assume that the reader has been introduced to this theorem previously and not dwell further upon it, despite 
the fact that it is very interesting. If more details regarding its formal application to important scenarios such as 
translational or rotational symmetry is desirable, the reader can have a look here. Some examples are shown below. 
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Symmetry Conservation law 

Translation in time Energy 

Translation in space Momentum 

Rotation Angular momentum 

Gauge transformation Charge (to be explained) 

Now, what precisely is a symmetry? It is an operation you can perform on a system that leaves it invariant. Consider 
the following example. 


A 



An equilateral triangle is carried into itself by clockwise rotation with 2tt/3 (an operation we denote i? + ) or 
counter-clockwise rotation (R-) with the same angle. Moreover, if we flip it about the axis Aa ( R a ) or corre¬ 
sponding axis through B and C (R^ and R c ), it is also left invariant. A combination of these operations would also 
be a symmetry operation. Mathematically, a set of symmetry operations must have the following properties: 

1. Closure. If Ri and Rj are in the set, then the product RiRj is also in that set. 

2. Identity. There is an element / such that IRi = Ril = Ri for all elements Ri. 

3. Inverse. For every element Ri , there is an inverse Rf 1 such that RjRf 1 = Rf x Ri = I. 

4. Associativity. Ri(RjRk) = (RiRj)Rk- 

These are the defining properties of a group. Note that commutativity of the elements is not required, so that 
RiRj ^ RjRi is allowed, in which case we have a non-Abelian group. 


In physics, most of the groups of interest are groups of matrices. Of particular importance is the U(n) group, 
meaning all unitary n x n matrices U (which thus satisfy £/ _1 = W). Orthogonal matrices are a special case, 
namely unitary matrices with real entries. 

• SU(n): unitary matrices with determinant +1. 

• O(n): real unitary matrices (O stands for orthogonal). 


• SO(n): real orthogonal matrices with determinant +1. 

The SO(3) group, which we will treat in more detail in the next section, describes rotations in three dimensions 
and is mathematically quite similar to the SU(2) group. An important aspect of groups is that: 

Every group can be represented by a group of matrices. 

This means that for every group element a, there is a corresponding matrix M a . This correspondence respects 
group multiplication: if ab = c, then M a M^ = M c . There may be several group elements represented by the same 
matrix. One then says that the group of matrices is homomorphic, but not necessarily isomorphic to G. If G itself 
is a group of matrices, such as SU(4), then it is a representation of itself (referred to as the fundamental representa¬ 
tion). There can, however, be many other representations by matrices in various dimensions. For instance, a trivial 
example is that every element can be represented by the 1 x 1 unit matrix (although it is not a very interesting 
representation). The group SU(2) has representations of dimension 1 (the trivial one just mentioned), dimension 2 
(the fundamental representation), dimensions 3, 4, 5, and so forth. One can always construct a new representation 
by combining two old in block-diagonal form: 


M a 


mP o 
0 mJ 2) 


( 2 . 2 ) 
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However, such representations are not counted separately: instead, one usually only lists irreducible representa¬ 
tions which cannot be decomposed in this manner. 


C. Introduction to group theory: SU(2) vs. SO(3) 


These two groups are of particular importance since they are related to both rotations of spins (half-integer as well 
as integer) and to internal symmetries between particles and in Lagrangians describing relevant quantum fields. 
Let us start by introducing some terminology. 


Isomorphism. A one-to-one correspondence between elements G of a group Q and G' of a group Q' such that if 
GiGj = Gk, then G-G' = G' k . 


Homomorphism. Let / be a mapping that maps the element G of Q onto G' of Q'\ G' = /(G). If 
f(GiGj) = f(Gi)f(Gj) holds for any two elements, / is a homomorphic mapping. 

Note that homomorphism signifies a n-to-one correspondence between the elements of the group, in general. A 
homomorphism with n = 1 is thus an isomorphism. 


Example 1. Equilateral triangle. The allowed symmetry operations constitute a group C^ v \ 

C 3v ~ {E 1 ,C 3 ,Cp,(7 1 ,a 2 ,(T 3 }. (2.3) 

Here, C 3 means a rotation counterclockwise with an angle 2tt/3 around the center while aj means mirror reflection 
around symmetry axis. The multiplication table of a group provides information about how the product of two 
group elements is related to an element in the group. For this particular group, the multiplication table is shown in 
the figure. 


(b) 


Multiplication table for Cs v (entries Gj o Gi) 


\Gi 

E 

c 3 

C 3- 1 

01 

02 

03 

E 

E 

c 3 


V 1 


<73 

C 3 


Cs 1 

1 

E 1 

1 

^3 

<71 

<72 

Cp 

C3 - 1 

E 

c 3 ! 

1 

^2 

<73 

<7l 

cri 

O' 1 

& 2 

03 

E 

C 3 

c 3 l 



^3 

01 

c , 1 

E 

c 3 

^3 

&3 

O' 1 

02 

c 3 

cp 

E 


2 



(c) 

Multiplication table for C 2 



Compare this group with C 2 = {E, C} where C is a rotation with 7 r so that C 2 = E. Let the following define the 
mapping / 

{E, C 3 , G 3 —)> E, {cri, < 72 , £ 73 } y C. (2.4) 
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This fulfills the homomorphism criterion, as seen from the multiplication table in part (c) of the figure. As seen, 
there are three elements in Cs v corresponding to one element in C2 : this is a homomorphism, not an isomorphism. 


The above considerations are related to representations of a group. Let Q be a group with elements Gi and associate 
a square matrix with each element D(Gi). If the matrices satisfy D(Gj)D(Gi) = D(Gk) for the corresponding 
relation GjGi = G&, then the set of matrices D{Gi) is a representation of Q. The mapping 

D: Gi^r D(Gi) (2.5) 

is then homomorphic as the same matrix may be associated with several elements. As mentioned before, a group 
can have representations in several dimensions. 
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Example 2. ID representation of C% v . The following one-dimensional matrices (scalars) constitute a represen¬ 
tation of C^ v and thus produce the same multiplication table as the above figure. 


D(E) = 1, D(C 3 ) = 1, D(C 3 l ) = 1, 
D(a i) = -1, D{(J 2 ) = -1, D(a 3 ) = -1. 


( 2 . 6 ) 


Consider now the rotation group in 3D, described by a matrix depending on three angles {a, /3,7}: 



Ca 0 


C/3 0 S/3 

R = 

Sa Ca 0 


0 10 


0 0 1 


-S,3 0 C/3_ 

We introduced the short-hand notation s a = sin a , 

c a = cos a 


0 


— Sry 0 

0 
1 


(2.7) 


considered (and thus not inversions), R is a 3 x 3 orthogonal matrix with det(Z£) = +1. Thus, the set of rotation 
matrices forms the group SO(3). The range of the parameters is important when discussing the relation of this 
group to SU(2), as we will now see. A possible representation of SO(3) in two dimensions is given by 


D^[R(a,p, 7 )] = 


e —i(a+ 7 )/2 CQS | _ e -i(a- 7 )/2 gin | 

e i(oi—7)/2 g ^ n £ e i(a+7)/2 cog £ 


( 2 . 8 ) 


We see that iM 1 / 2 ) is a special unitary 2x2 matrix. The word "special" means that the determinant is +1 rather 
than -1, and the superscript indicates that the group SU(2) is of importance for rotation of spin 1/2 particles. 
Hence, the matrices given in the equation above form the group SU(2) if the parameter range of (a , /3,7) is such 
that all possible SU(2) matrices can be produced. To proceed, notice that R( 0, 0, 0) and R( 0, 0, 27r) are both 
corresponding to the same identity operation. However, D^^ 2 \ 0, 0, 0) = — £>( 1 / 2 ) (0, 0, 27t). In effect, both the 
matrix D^^ 2 \R) and — jjt 1 / 2 ) (ij) correspond to R. This seems to indicate that SU(2) would be a double-valued 
representation of SO(3), meaning that there are two elements in SU(2) that can represent one element in SO(3). 
This is not allowed for a true representation as it provides an ambiguity. We can nevertheless resolve this issue 
by considering precisely the parameter range for the angles in iM 1 / 2 ). In order for these matrices to comprise 
SU(2), we need for instance 0 < 7 < 47r. However, by restricting the range to 0 < 7 < 27r, iM 1 / 2 ) can represent 
SO(3). Note that in this way, it is no longer the full group SU(2), but we have removed the double-valuedness 
by restricting the parameter range. We can also turn the argument around. Since there are two elements in SU(2) 
corresponding to SO(3), the groups are homomorphic: 


SO(3) is a representation of SU(2). 


This is not an isomorphism (one-to-one correspondence) and SU(2) is not a representation of SO(3). 


We here summarize some general properties related to the groups SU(2) and SO(3). 

• The representations of SU(2) may be classified by a number j = 0, 1, §,... and the dimensionality of the 

representations is 2j + 1. 

• The spin-j representation of SU(2) is {D^} where are rotation matrices for angular momentum = 
0 1 1 ^ 

j =half-integer: the representation is faithful to SU(2), for instance like j = \ in our above treatment. 
j =integer: the representation is also a representation for SO(3), and faithful as long as j 0. 
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D. Isospin 


The close similarity between the proton and neutron in terms of their mass (m p ~ 938 MeV/c 2 and — 939 
MeV/c 2 ) caused Heisenberg to suggest that they could be regarded as two states of a single particle: the nucleon 
N. To explore such an idea, let us write 


N = 


a 

P 


where p 


1 

, n = 

0 

0 


1 


(2.9) 


so that the proton and neutron can be characterized by their isospin I. This is in analogy to the treatment of spinors 
which have the same structure, only that their components are a part of spin-space. The physics behind this idea 
is that the strong interactions should be invariant under rotations in isospin space, just like electrical forces are 
invariant under rotations in ordinary space. Put differently, Heisenberg proposed that if the charge of the proton 
was somehow "turned off", it should be identical to the neutron. Noether’s theorem dictates that isospin should 
then be conserved in all strong interactions, just like angular momentum is conserved in processes with rotational 
invariance. 


Formulated in terms of our recently discussed group theory, strong interactions should then be invariant under an 
internal symmetry group SU(2) and nucleons belong to the two-dimensional representation with isospin Note 
that since m p is not exactly equal to m n , SU(2) isospin symmetry should not be expected to be an exact symmetry, 
but it should be very good. Isospin may also be used to classify other particles besides nucleons, such as the 
structure of the hadrons. The motivation for this is the same as for n and p: similar masses, but different charges. 
For the pions one assigns I = 1, so that the different pion particles are represented as different states in isospin 
space: 

7T+ = |1, 1), 7T° = 11, 0), 7r" = |1,-1). (2.10) 


The A has I = 0, so that A = |0,0), whereas the A particles have I = |: 


A ++ 


|-, -), A + 
l 2 2 


,3 !v 

I 2 ’ 2 ’ 



( 2 . 11 ) 


The number of particles r in a multiplet of hadrons is related to the isospin by r = 21 + 1. The third component 
Is is determined by the charge, so that Is = I for the highest charge. 


Isospin also has dynamical implications rather than simply being used a tool to classify particles. Consider the 
(isospin) angular momentum of two nucleons. Their total value can be I = 0 or I = 1, according to standard rules 
for how to add two angular momenta. In particular, we have three possible triplet states: 


|1,1) =pp, |1,0) 


1 

71 


(:pn + np ), |1, —1) = 


nn 


( 2 . 12 ) 


and one singlet state 


| 0 , 0 > 



np). 


(2.13) 


Now, the n and p form a bound state known as a deuteron (d). We can then immediately conclude that it has to be 
a singlet state in isospin space. Why? If it were a triplet state, then pp and nn should also occur in nature simply 
by rotating the isospin. However, such long-lived states are not known to exist (to be precise, nn was actually 
observed as an intermediate state in 2012, which shows that isospin is not a perfect symmetry). It is possible to 
extend the concept of isospin further to particles such as E’s and 0’s, which have spin quantum number 8 = 1/2 
and somewhat similar masses. Nevertheless, it is certainly a stretch to call all of these particles different states of 
one single particle. SU(2) isospin symmetry is a a good, but not exact, symmetry. 


It is instructive to compare the isospin classification of it and AT. There are three pions: 7r + , 7r°, it . All have 
similar mass and they fit nicely into a I = 1 multiplet. As for kaons, there are 4 of them: AT + , AT - , AT 0 , K°. Note 
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that K° and K° are distinct (unlike i r° which is its own antiparticle). Now, the question is: should the kaons be a 
I = 3/2 or 7 = 1/2 multiplet? In principle, we could assign them as follows (sticking with the notation |7, / 3 )): 


K+ 


I- _) K° = I- -) K° 
2 ’ 2 1 '2'2 h 




(2.14) 


If we do, however, the charge is not descending with 7 3 as it should. Moreover, there is a more serious problem: 
the kaons have different strangeness quantum number. The K + and K° have S = —1 as they consist of us and 
ds , respectively. The K~ and K° have S = +1, as they consist respectively of us and ds. The whole idea 
with isospin is that the particles are supposed to interact strongly in the same way, and this cannot be the case 
above since strangeness is conserved in strong interactions. Therefore, we must assign the kaons into two isospin 
7 = 1/2 doublets: 


K+ = 



,K° = 

£ 

II 

O 

,i 

l x 


,1 

2 ’ 

2 

, K~ = 

2 


(2.15) 


even if all have similar masses. 
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E. Charge conjugation and parity 
Charge conjugation. 

Classical electrodynamics is invariant under a sign change of charge (the forces remain the same). In particle 
physics, the generalization notion of this symmetry is the charge conjugation C: 

c\p) = |p) (2.16) 

where | p) represents a general particle and not necessarily a proton. Therefore, the effect of the C operator is that 
it changes the sign of charge, baryon- and lepton-number, strangeness, and additional quantum numbers associated 
with quarks. It leaves mass, energy, momentum, and spin untouched (and thus also the helicity of a particle). We 
see that C 2 = /, which means that the eigenvalues of C are ±1. Only particles that are their own antiparticles are 
eigenstates of C (such as the photon). However, a composite system such as spin-1/2 particle pair pp with relative 
orbital angular momentum l and total spin s is also an eigenstate of C with eigenvalue (—l) z+s . Mesons are 
examples of this. C is a multiplicative quantum number and conserved in strong and electromagnetic interactions, 
whereas it is not conserved in weak interactions. 


Example 3. Pion decay. The process i r° 7 + 7 is allowed since CV 0 = +1 and C = (—l) n for n photons. 
However, 7r° —^7 + 7 + 7 would not be allowed via the electromagnetic interaction since C is not conserved in 
this process. A weak interaction could mediate this process, but its amplitude would be very small. 


Can 7 decay to multiple photons? 

According to our discussion on charge conjugation, we see that 7 —)> 7 + 7 violates C-symmetry. In fact, it is not 
even kinematically possible since the photon has no rest frame. Note that the process is not possible despite the 
fact that we can draw a Feynman diagram for it: 



The existence of a Feynman diagram is thus no guarantee for the viability of a decay or scattering process, 
similarly to the primitive vertices. What about 7~>7 + 7 + 7 ?It seems fine in terms of C- symmetry, but the only 
way that 7 could decay into an odd number of photons is if all particles are moving colinearly in order to satisfy 
energy and momentum conservation. Computation of the actual Feynman diagram (which we will learn how to do 
later) nevertheless gives zero probability of this process to all orders perturbatively. 


Parity. 

Prior to 1956, physicists generally believed that nature had a parity symmetry: letting r (— r) in any physical 
process should also yield a physically permissible process. Lee and Yang found that there was strong experimen¬ 
tal evidence of parity-invariance in the strong and electromagnetic processes, but not when it came to the weak 
interactions. Lee and Yang initiated a famous experiment, which was conducted by Wu and collaborators, that 
consisted of the setup shown in (a). 


( a ) 


z A 


(b) 



Co 60 
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The spins of radioactive Co 60 were aligned with the z-axis through application of a magnetic field. Upon decaying, 
the Co 60 would emit electrons that turned out to escape mostly in one direction, as shown in (a). Based simply on 
this observation, the conclusion was that parity must be broken! If we let r —(—r), the parity-reversed process 
is obtained as shown in (b), but the experiment demonstrated that this does not occur. Recall that spin (angular 
momentum) is invariant under a parity transformation (for instance, L = r x p does not change since both r and 
p change sign under parity). 

In fact, P violation appears to be prominent in weak interactions. This is exemplified via the neutrino. Define first 
the helicity of a particle as m s /s where the z-axis is aligned with the direction of motion: 



Positive helicity (right-handed) Negative helicity (left-handed) 


Earlier experiments showed that all neutrinos seemed to be left-handed while all anti-neutrinos were right-handed. 
Consider for instance the pion decay i r _ p~ -Yv^. If tt~ decays from rest (in the CM frame), the p~ and 

spins must be opposite since s n - = 0 and angular momentum is conserved: 



S S 


As seen, the helicities must then be the same, and experimentally one measures the muon to be right-handed in 
the tt~ rest frame. Now, if the neutrino were truly massless, its helicity would have been the same in all inertial 
frames since then v v = c. But if neutrinos do have mass (which we now know that they do), then v v < c and we 
could in principle find a reference frame S moving at a speed v v < vs < c. In this frame, the neutrino would be 
right-handed and parity symmetry would be restored. However, this has not yet been observed. 

The parity operator P acts differently on different mathematical quantities. We have Pr = — r. However, PL = L 
since Pp = —p. We thus have 

• Pv = —v for a vector (also known as polar vector). 

• Pa = a for a pseudo vector (also known as axial vector). 

Note that we can construct a pseudoscalar from the triple product of three polar vectors (o • (6 x c)): 

• Ps = s for a scalar. 

• Pp = — p for a pseudoscalar. 

Like C, P is multiplicative and conserved in strong and electromagnetic interactions. It is not conserved in weak 
interactions. The parity group has two elements: I and P (since P 2 = I). The eigenvalues are ±1 and particle 
such as hadrons are eigenstates of P, so that they may be classified according to their eigenvalue. Quantum field 
theory dictates that the parity for fermions must be opposite to the corresponding antiparticle. For bosons, on the 
other hand, particle and antiparticle parity is the same. Spin-1/2 fermions have intrinsic positive parity (quarks, 
leptons, nucleons). In composite systems, parity is multiplicative. Excited states acquire an extra factor (—1)* 
where l is the angular momentum. The photon, being a vector particle, has parity —1. The terms "pseudo" and 
"vector" are in reference to a particle indicate the parity eigenvalue of the particle. 

F. CP-violation and the TCP theorem 

Although C and P are not invariant in terms of weak interactions, could it be that their product CP is? To 
investigate this, consider the process K° K° which is possible via a second-order weak interaction: 
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K° 


K u 


W~ 


W~ 


W~ | 


| w~ 


K° 


K° 


These are two of the contributing diagrams. Due to the interconversion, what is experimentally observed is usually 
some linear combination of K° and K°. We have 


since these are pseudoscalar particles. Moreover, 

n\ 0\ _ 


It follows that CP\K°) = -\K°) and CP\K°) 
normalized eigenstates of CP as: 


where we defined the states 


\Ki) = 


V2 y 


-\K°), P\K°) = -\K°), 

(2.17) 

\K}°), C\K°) = | K°). 

(2.18) 

= —\K°). Using these properties, we may then construct the 

1 K 1 ), CP\K 2 ) = -| K 2 ), 

(2.19) 

K 0 )), \K 2 )^2=(\K°) + \K 0 )). 

(2.20) 


If CP is a conserved quantity, then K\ (K 2 ) can only decay to a state with CP = +1 (—1). As a consequence, 
K\ —» 27r and K 2 —» 3i r are allowed, but K 2 27r should be impossible. Imagine then that we start the 

experiment with a beam of K° produced by an accelerator: 


1 


\K°) = —(\K 1 ) + \ K 2 )). 


( 2 . 21 ) 


The 27r decay is faster than the 3tt since the released energy is much greater. Thus, for a long beam trajectory only 
37r events should be observed at the end. However, an experiment by Cronin and Fitch in 1964 proved that 2tt 
events did occur at the end of the beam, indicating violation of CP. The long-lived kaon state | Kl) should then 
read: 


I Kl) = 


1 


a / 1 + l e l 


f(\K 2 ) + e\K!)), 


( 2 . 22 ) 


since it apparently is not a perfect eigenstate of CP , where e is a small number. Kaons are typically produced in 
strong interactions as eigenstates of strangeness, but they decay via the weak interaction. Once created, they are 
therefore better thought of as a superposition of weak eigenstates and K 2 ). 

What about time-reversal symmetry T? Should the laws of physics work equally well when reversing time? It 
seems not, due to the TCP-theorem from quantum field theory. It is derived only from general assumptions, such 
as Lorentz invariance, and states 
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The combined operation of T, C, and P is an exact symmetry of any interaction. 

Therefore, if CP is violated, T should also be violated. If this theorem is correct, one can prove that every particle 
should have exactly the same mass and lifetime as its antiparticle. So far, no experiment has proven otherwise. 
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III. THEORY OF RELATIVISTIC KINEMATICS 

Learning goals. After reading this chapter, the student should: 

• Be able to perform calculations using 4-vectors and Lorentz-transformations. 

• Understand the basic principles of the special theory of relativity and how this influences particle decay and 
scattering. 


We shall here establish the basic principles, notation, and terminology of relativistic kinematics. This will be 
crucial in order for us to develop the mathematical foundation for particle dynamics in later chapters of this book. 


A. Lorentz transformations 


We will assume here that the reader has been introduced the Einstein’s special theory of relativity in previous 
courses, for instance in classical mechanics, and thus only briefly revisit some of the key insights here. For a more 
detailed introduction to this topic, the reader can have a look here. The special theory of relativity states that all 
laws of physics are equally valid in all inertial systems. An inertial system is defined as a system where Newton’s 
first law is obeyed, namely that objects move along straight lines with constant speed unless acted upon by a force. 
Consider then two inertial frames S and S' where S' is moving at constant velocity v relative S: 

A A 


5 


5 ' 


► x 


x 


/ 


Assume that the origins of the frames coincide at t = t' = 0. A Lorentz-transformation establishes the relation be¬ 
tween an event as seen in S at coordinates (x, y, z, t ) and the same event as seen in S' at coordinates (V, y ', 2 /, t')\ 

x' = 7 (x — u£), y' = y, z' = z, t' = 7 (t — vx/c 2 ). (3.1) 


Here, we introduced 


1 

^ y/l ~ V 2 /C 2 


(3.2) 


The inverse set of transformations (going from S' to S ) are obtained by letting r' r and v — >> (—v). The 
transformation rules can be derived by demanding that the speed of light c should be the same in both inertial 
frames. This set of of transformations has a number of consequences: 

1. The relativity of simultaneity. If two events occur at the same time in S, but at different locations, then 
they do not occur at the same time in S'. Namely, if tA = ts, then t' A = t' B + ^J (xb — xa)- 

2. Lorentz contraction. An object of length L' as measured in S' has a length L = L' /7 when measured in 
S. The moving object is thus shortened by a factor 7, which applies to lengths along the direction of motion 
(dimensions A the motion are unaffected). 

3. Time dilation. A clock at the origin of S' ticking an interval T', will be seen to have ticked an interval 
t = 7 T' by an observer in S. Thus, one may state that moving clocks run slower. This is important with 
respect to particle physics since each unstable particle has their own "built-in" clock: a moving particle lasts 
longer than it would at rest. Note that tabulated lifetimes for particles always refer to the particles restframe. 
Due to time dilation, cosmic ray muons produced in the upper atmosphere make it to ground level even if 
their lifetime in the rest frame is not long enough to do so. 


Download free eBooks at bookboon.com 


30 








INTRODUCTION TO PARTICLE PHYSICS 


THEORY OF RELATIVISTIC KINEMATICS 


4. Velocity addition. Suppose that a particle is moving in the ^-direction at speed u' with respect to S'. 
We then have u' = Ax' / At'. What is the speed of the particle with respect to SI It moves a distance 
Ax = j(Ax' + vAt') in a time At = 7[At' + (v/c 2 ) Ax'], so that 

Ax (Ax'/ A t')-\-v u' + v 

At 1 + (v/c 2 ) (Ax'/At') l + u'v/c 2 

The classical, non-relativistic answer from a Galilei-transformation would be u = u' + v, so the correction 
from relativity is in the denominator. It only matters if u' and v are close to c. Note that u' = c gives u = c. 


B. 4-vectors 


Define the position four-vector x^, g = 0,1,2, 3,4: 

x° = ct , x 1 = x, x 2 = y, x 3 = z 


The Lorentz transformation may then be written compactly: 


3 

v =0 


where are the coefficients of the matrix A: 


7 —7 f3 0 0 

—7/3 7 0 0 

0 0 10 

0 0 0 1 


(3.4) 


(3.5) 


(3.6) 


so that for instance A[j = 7, Aj = —7/3, and so forth. We will from now on use Einstein’s sum convention for greek 
indices, which means that for repeated indices a summation is implicitly understood. The Lorentz-transformation 
is then written simply as x M = A^x 1 '. It can be shown that the particular combination: 

/ = (x 0 ) 2 - (x 1 ) 2 - (x 2 ) 2 - (x 3 ) 2 (3.7) 

has the same value in all inertial systems, so that I is also equal to (x 0 ') 2 — (x 1 ') 2 — (x 2 ') 2 — (x 3 ') 2 . Therefore, 
/ must be a Lorentz-invariant as it remains the same under a Lorentz-transformation, analogously to how r 2 = 
x 2 + y 2 + z 2 is invariant under rotations in three-dimensional space. To keep track of the different signs in /, we 
introduce the metric which are the components of the matrix g: 


10 0 0 
0-100 
0 0-10 
0 0 0 -1 


(3.8) 


We may then express I as a double sum: / = g llv x^x v . From this, we may define the covariant four-vector x M 
(index down): 


(3.9) 

It follows that xo = x° and Xi = —x l , i = 1,2,3. For index up, we have a contravariant four-vector x^\ 

x u = g ^ Xum ( 370 ) 


We thus have: 


Index up (x^)\ contravariant vector. Index down (x^)\ covariant vector. 
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The metric g can thus be thought of as raising or lowering the index of a four-vector. One way to memorize this is 
the following: con/ravariant vectors have index on top. Our invariant I can now be written even more compactly 
as 


I = x^. (3.11) 

This type of notation also generalizes to non-Cartesian coordinate systems and curved spaces encountered in the 
theory of general relativity. We define a four-vector a fJj formally as an object that transforms in the same way as 
x M when going from one inertial-system to another: 

= Ayr (3.12) 

We also associate a covariant four-vector a M to each such contravariant vector: 

Oju = g^a". (3.13) 

According to our definition above, we go from covariant to contravariant four-vectors via = g^ u a v . Here, g^ v 
are the elements of the matrix g~ x . For our case, we see that g = g~ x so that g^ = g ^ u . For any two four-vectors, 
and the quantity = a • b is invariant. We will sometimets also simply write = ab when there 
is no risk for confusion. Notation-wise, we may distinguish between time and spatial coordinates in this invariant 
since = a°b° — a • b. Note that a 2 = need not be positive: 

• If a 2 > 0, then a M is said to be timelike. 

• If a 2 < 0, then a M is said to be spacelike. 

• If a 2 = 0, then a M is said to be lightlike. 

Tensors transform in a generalized way compared to vectors: s^ u = A^A "s K cr. Note that any tensor of rank n + 2 
can be contracted to a tensor of rank n by performing a summation over upper and lower indices. For instance, s 
is a scalar while ty is a four-vector, and so forth. 
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C. Energy and momentum 

We previously stated that time dilation refers to the phenomenon that a moving clock measures an eigentime dr 
whereas a stationary observer meaures dt. The two are related via 

dr = dt/ 7 . (3.14) 

The quantity r is an invariant: all observers agree on which time that is measured in the particle’s restframe, even if 
the time measured in their own frame might differ. Just as 4-position vector gives an invariant x^x^, the 4-velocity: 

= dx^/dr = 7 (c, v ) (3.15) 

gives an invariant when contracted: 77 ^ 77 ^ = 7 2 (c 2 — v 2 ) = c 2 . Note that if we had defined relativistic momentum 
as mv, it would not have provided conservation of momentum in all inertial systems, assuming that it was valid in 
one such system. However, mr assures that this holds. Thus, the appropriate 4-momentum vector reads: 

= (yrnc,ymv). (3.16) 

Let us define the relativistic energy as E = 7771 c 2 . Then, it follows thatp^p^ = E 2 /c 2 — p 2 = E 2 /c 2 — y 2 m 2 v 2 = 

m 2 c 2 . We see that the relativistic momentum reduces to its classical counterpart when 1 ; « c since 7 —>> 1 then. 
However, for the energy we obtain 


E 


= me 1 


1 v 2 3 u 4 

2 c2 + 8? + 


(3.17) 


from which it is seen that an extra constant term me 2 exists in the limit v « c. This term survives even in the 
absence of any kinetic energy, v = 0. It is the so-called rest energy of the particle. The remainder of the energy 
must then be associated with the kinetic content: ^kinetic = (7 — l)mc 2 . An aspect which has no counterpart in 
non-relativistic mechanics is that a massless particle with non-zero momentum may exist: E — \p\c. The fact that 
this is possible is demonstrated by the formulae 


P 


= mv / y/l — v 2 /c 2 and E = me 2 / y/l — v 2 /c 2 . 


(3.18) 


since m -A 0 still can give a finite p and E if simultaneously v c. 


D. Collisions 

Energy and momentum are, as in non-relativistic mechanics, conserved quantities in particle collisions. This may 
be compacty expressed via the 4-momentum vector: p t ^ tal is a conserved quantity. Mass, however, is not necessarily 
conserved. The crucial concept here is that rest energy (and thus mass) may be converted into kinetic energy or vice 
versa. To obtain consistency with the non-relativistic limit, where kinetic energy is converted into some internal 
form of energy (such as heat), we need only realize that all such internal energies are reflected in the rest energy 
of the body. On a macroscopic scale, the rest energy is much greater than the internal energy, so that the added 
mass from internal energy is completely negligible. Strictly speaking, however, a hot potato weighs more than a 
cold potato. Imagine two objects that have identical masses when at the same temperature. If one of the objects is 
heated, so that its internal temperature increases, its weight will also become larger than the other object according 
to our reasoning above. 


Example 4. Particles merging. Consider the following scenario where two particles merge into one via a head-on 
collision. 

Before After 

m 

• . M 

-v 


m 


+v 
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The task is to find the mass M of the final particle when |v| = 3c/5. Conservation of momentum gives p x = — p 2 . 
For the energy, we obtain 


Me 2 = 2 E m = 2mc 2 /y/l - ?; 2 /c 2 = 5mc 2 /2. (3.19) 

Thus, we find M = 5m/2 > 2m, meaning that we have a so-called sticky collision where the mass has increased. 
Note that the reverse process, M decaying into two particles with mass m each, is only possible if M > 2m. A 
deuteron (bound state of ap and n) weigs less than m p + m n . Therefore, it will not decay unless energy is injected. 
This is actually a concrete example of how the (negative) binding energy is reflected in the total rest mass. 


Example 5. Pion decay. A pion at rest decays into a muon and a neutrino. What is the speed of the muon? Let us 
start with the conservation laws at hand: 

En = E fl + E v , p n =p ii +p l/ ^p^ = - Pu . (3.20) 

One solution strategy is to find the energy of a particle when you know its momentum, by using the invariant 
E 2 — p 2 c 2 = m 2 c A . When the energy has been identified in this way, use that E = 7 me 2 and p = 7 mv which 
gives v = pc? IE. For this example, we have 

Ett = m v c 2 , Efj, = E v = \p u \c = \p^\c. (3.21) 

This provides \p \ = ( m 2 — m^)c/2m 7r and E M = (m 2 + m^)c 2 /2m 7r . Plugging these expressions into v = 
pc 2 /E then provides the muon velocity. 

We end this chapter by emphasizing the difference between a conserved quantity and a invariant quantity. Energy 
is conserved (same value before and after collision), but it is not an invariant: the energy can have different values 
in different inertial frames. On the other hand, mass is invariant (same value in all inertial systems), but it is not a 
conserved quantity in general (before and after a collision). 
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IV. QUANTUM ELECTRODYNAMICS AND FEYNMAN RULES 

Learning goals. After reading this chapter, the student should: 

• Be able to work with the Klein-Gordon and Dirac equation mathematically and understand what their solu¬ 
tions represent. 

• Be able to compute the Feynman amplitude M for Feynman diagrams in quantum electrodynamics, and also 
be able to obtain decay rates and scattering cross sections from M. 

• Understand the principle behind renormalization and how it is related to higher-order Feynman diagrams. 


This theory is a major cornerstone in particle physics and serves as the foundation for much of the material that we 
will look at later. We will here introduce the Dirac equation and its solution, Feynman rules and how they work 
specifically in QED, in addition to some pragmatic calculational tools. 


A. The Dirac equation 

In non-relativistic quantum mechanics (QM), it is the Schrodinger equation which is the fundamental equation that 
describes the behavior of physical systems. In relativistic QM, one has different fundamental equations depending 
on the spin of the particle. 

• Spin-0: Klein-Gordon equation. 

• Spin-1: Dirac equation. 

• Spin-1: Proca equation. 

We will later in this chapter establish a set of Feynman rules which will allow us to evaluate Feynman diagrams 
mathematically. Once this set of rules has been established, the underlying field equation for a particle with a 
given spin is no longer immediately needed. However, the notation of Feynman rules for spin- \ particles does 
require that we are familiar with the Dirac equation and we therefore treat it, and its solutions, here in detail. 

It is instructive to first consider how the Schrodinger equation (SE) can be motivated. If one starts out with the 
classical, non-relativistic expression for energy, E = p 2 /2m + V, and then substitute the operators 

p —>■ ?V, E —> i hdt (4.1) 

1 

one ends up with precisely the time-dependent SE: 

h 2 

-V 2 ^ + VV = \hd t y. (4.2) 

2m 

The same principle can be applied for the Klein-Gordon (KG) equation. The starting point is the classical, rela¬ 
tivistic expression for energy, E 2 — p 2 c 2 = m 2 c 4 , or — m 2 c 2 = 0. Consider a free particle to begin with 
such that V = 0. Now, perform the same substitution as in the SE case in order to introduce operators 

P M = 1 ( 4 - 3 ) 

which is simply a more compact way of writing Eq. (4.1). We introduced <9 ;/ = In effect, this means that 

(E/c,-p)->ih(dt/c,V). (4.4) 

The KG equation now becomes: 

— m 2 c 2 \h = 0 (4.5) 
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which alternatively can be written as: 

+ V 2 ^ = (4-6) 

In the presence of an EM potential, we have to take into account the influence of the electric scalar potential 
and magnetic vector potential. This is done in the usual manner: the energy is changed to E E — ecj) while 
the momentum is augmented by the magnetic vector potential via p p — ^ in order for the physics to be 
gauge-invariant (the so-called minimal coupling). In this case, we obtain 

(i hd t - ecj))H = c 2 (-iW- — J + m 2 c 4 tf. (4.7) 

Note that while the SE is 0{t), the KG equation is 0(t 2 ). As we now will turn our attention to the Dirac equation, 
it is also worth to emphasize that 4/ is a scalar field and that any solution to the Dirac equation is also a solution of 
the KG equation, as there is no reference to spin in the KG equation One can loosely think of the KG equation as 
describing the magnitude of the field, whereas the Dirac equation describes both the magnitude and the "direction" 
(spin) of the field in the case of spin-| particlecs. Note that the opposite is not true: a solution of the KG equation 
is not necessarily a solution of the Dirac equation, since the KG equation has lost a degree of freedom (spin) 
compared to the Dirac equation. 

Now, in order to obtain the Dirac equation, the strategy is to factor the E — p relation (often called a dispersion 
relation). If p = 0, this is easy: 

(p 0 ) 2 — m 2 c 2 = 0 = (p° + rac)(p° — me) —(p° — me) = 0 or (p° + me) = 0. (4.8) 

Both options guarantee that p M p M — m 2 c 2 = 0. For the more interesting case p / 0, we would need something 
like: 


(p^Pv - m 2 c 2 ) = ( P K p K + rac)(7 A p A - me), (4.9) 

where and y A are undetermined coefficients for now. To match the two sides of the equation, we obtain 

/3 k 7 Vpa - mc(/3 K - - m 2 c 2 . (4.10) 

If = 7 K , the linear terms in momentum are seen to cancel. Now, we must find 7 ^ so that the second order terms 
also match: 


P^Pft = 1 k 1 X PkP\- 


(4.11) 


The problem with this equation is that it is impossible to satisfy for scalars 7 *. However, it can be satisfied if the 
7 -quantities are matrices instead. Writing out Eq. (4.11) more explicitly, we obtain 


(p 0 ) 2 - (p 1 ) 2 - (p 2 ) 2 - (p 3 ) 2 = (7°)V) 2 + (V) 2 (p 1 ) 2 + (7 2 ) 2 (p 2 ) 2 + (7 3 ) 2 (p 3 ) 2 

+ (7°7 1 + 7 1 7°)PoPi + • - - (7 2 7 3 + 7 3 7 2 )P2P3- 


(4.12) 


We need a set of matrices to get rid off the cross terms, and we thus require that the matrices have the following 
properties in order to accomplish this: 

(7 0 ) 2 = 1, (V) 2 = (7 2 ) 2 = (7 3 ) 2 = -1 (4-13) 

in addition to 

Y}= = 0 for // ^ v. (4.14) 

It is clear why scalars could not satisfy the above properties: scalars commute. The above requirements, can be 
written quite compactly as: 


{7^,7"} = 2^, = 


1000 

0-100 

00-10 

000-1 


(4.15) 
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is thus the Minkowski metric. There exists an infinite number of physically equivalent sets of 7 -matrices that 
satisfy these properties. The smallest ones are 4 x 4 and take the explicit form: 


1 0 


o 

a 1 


II 



0 -1 

-a* 

Ol 


where ^ means a 2 x 2 matrix. Now, the Dirac equation factorizes: 

(jfp» — m 2 c 2 ) = ( 7 K p K + mc)(-i x p x - me) = 0 . 
The conventional choice is now to consider: 


(4.16) 


(4.17) 


7 % - me = 0. (4.18) 

If we let i hdfj, as for the other quantum equations (thus introducing operators), we obtain the final form of 

the Dirac equation: 

i — mc'ip - 0 . 


Here, ^ is a Dirac spinor (which is not a 4-vector ): 




^2 

^3 

/04 


(4.19) 


Since it is not a 4-vector, it does not transform via a Lorentz transformation when changing inertial system. 

We note in passing that the Pauli equation, which is essentially the SE with spin taken into account, is obtained by 
considering the Dirac equation in the low-energy (non-relativistic limit). We underline that spin is not a relativistic 
correction: it is a fundamental property of particles which existence does not rely on the speed at which the 
particles move. The Dirac equation thus describes both spin and relativistic effects. 
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Alternative derivation of the Dirac equation. 

We saw above that we could fulfil the equation: 

— m 2 c 2 = (7 K p K + mc){^ x p\ — me) = 0 (4.20) 

with appropriate 7 -matrices. We then took one of these factors, say 7 ^p^ — me, and set it to zero. Assigning 
operators and acting on a wavefunction, we got: 

i — metp = 0. (4.21) 

But AB = 0 does not imply that A = 0 or B = 0 when A and B are matrices. Therefore, we are strictly speaking 
not allowed to conclude that one of the factors in Eq. (4.20) is zero, even if it seems to give us the correct result. 
We therefore give here a more mathematically satisfying derivation which produces the same result. 

Consider the KG equation: 


If we factorize this equation (which is already in operator form), we get: 

(v 2 -%)$= (. Ad x + Bd x + Cd z + i Dd t /c)(Ad x + Bd x + Cd z + L Dd t /c)^ 

= (~y) tp = (4.23) 

This equation is satisfied if 


(. Ad x + Bd x + Cd z + iDdt/c) i/j = =b k,i/j (4.24) 

and if the terms {A, B , C, D} are chosen so that the cross terms on the r.h.s. of the first line in Eq. (4.23) vanish. 
It turns out that this occurs when 


{A,B,C}= vy k , D = 7 0 . 

so that Eq. (4.24) gives the Dirac equation: 

i± meip = 0. 

This equation can also be brought to another form which is commonly used by multiplying it with 7 0 : 

7 °( 7 °Po — 7 • p — me)'ip idt'ip — 7°7 • pip — mj°ip = 0 . 

This may be written as: 

i d t i> = H D i> 

where we defined the Hamiltonian 

Hd = a • p + /3m 


(4.25) 

(4.26) 

(4.27) 

(4.28) 

(4.29) 


and a = 7 % and (3 = 7 0 . This form of the equation describing a relativistic spin-1/2 particle also makes it clear 
how to include a potential energy V (r), namely Hd Hd + V(r) as usual. 


Solutions to the Dirac equation. 

Consider first the simple case where ^ is independent on r: d^/dxj = 0 (j = x : y,z). This should be the case for 
p — 0, since p M i hd^. The Dirac equation then reads 


1 0 


d'l/’A 

dt 

. me 2 

IpA 

0 -1 


d^ B 

dt 

~ 1 h 

P>B 


(4.30) 
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where xpA = [Vh, and xps = [^3, VP T - The superscript T denotes matrix transposition. The equations for 
'ipA and xps are decoupled, and the solutions are obtained as: 

VuW = e-^/W^o), ^ B ( t ) = e+ i (-c 2 /s)t v , s ( 0) . (4.31) 

For a particle at rest, E = me 2 , so il> a is seen to carry the usual factor e ~ lEt / n , The negative sign in the exponent 
of ipB represents antiparticles with positive energy. For instance, xp a may represent an electron, in which case xpB 
represents a positron. 

The particle and antiparticle (p and p) parts are each 2x1 spinors since they have s = For p = 0, we then have 
four independent solutions 

• Electron spin-^: = e~ l ( mc2 / h ) f [1, 0, 0, 0] T . 

• Electron spin-|: xp^ = 1, 0, 0] T . 

• Positron spin-^: xp^ = e + ^ mc2 /^ t [0, 0,1, 0] T . 

• Positron spin-|: xp^ = e + 1 ( mc2 /^) t [o, 0, 0,1] T . 

We now turn to the more difficult, but much more interesting, case of p ^ 0. We look for solutions to the Dirac 
equation in a plane-wave form: 


xp(r, t) = ae-^ Et ~ p - r ^ h u{E, p) (4.32) 

which in 4-vector notation can be written (more compactly, as usual): xp(x) = ae~ lx ' p / h u(p). Here, a is a 
normalization constant and we must determine u(p). Inserting our ansatz into the Dirac equation produces: 

— mc)u = 0. It is simpler than the original equation because there are no derivatives present. Now, we 
see that since 


7% = 


E 


1 0 


0 a 


-p- 


Q -1 

—a 0 


it follows that 


(7 ^V\i ~ mc)u = 


(. E/c — mc)uA — P • (tub 
p • aiiA ~ {E/c + mc)uB 


(4.33) 


(4.34) 


The ( E/c± me) terms are implicitly understood to have an 1 structure as dimensionality requires, and we follow 
this convention in what follows (i.e. omitting identity matrices where it is clear from the context that such a matrix 
should be present). The solution to the above equation is: 

c c 

U A = -2 (p ' Z.) U B, U B = -=--o (p ■ v) u A- (4.35) 

E - mc z E + mc z 


Combining these equations, we find 


U A 


c 2 (jp • a) 2 
E 2 — m 2 c 4 A 


(4.36) 


which is seen to be consistent as (p • a) 2 = p 2 = ( E 2 — m 2 c 4 )/c 2 . The wavefunctions are then determined up to 
a normalization constant: 


1 . u A 


2. u A 


, which gives u B = 

, which gives u B = 


Pz 

Px + i Py 


Px - i Py 

~Pz 
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3. u B 


4. u B 


, which gives u A = E _ c mc 2 
, which gives u A = E _ c mc 2 


Pz 

Px + i p y 

Px - ip y 
-Pz 


In the first two cases, we have to use E = -\-^p 2 c 2 + m 2 c 4 in order to avoid u B diverging when p —» 0. These 
are the particle solutions. In the two last cases, the opposite is true: we must use E = — y/p 2 c 2 + m 2 c A to avoid 
divergences. These are the antiparticle solutions. Regarding the normalization, there exists different conventions. 
A common convention that we shall stick with here is to normalize the above spinors so that u^u = 2\E\/c. The t 
means conjugate transpose, so that 


a 



7 

S 


,t - 


= [a*,/r , 7 V*]. 


(4.37) 


Note that all of the choices 1-4 above do not correspond immediately to spin-t, i electrons or positrons as they 
have more than one non-zero entry in the spinor. Neither of them are in fact eigenstates of the spin operator 


_ h a 0 

2 2 0 a 


(4.38) 


Only for the special case of p x = p y = 0 will the spinors 1-4 be eigenspinors of S z and S 2 . We noted that 
E = -yVc 2 + m 2 c 4 corresponds to antiparticles with positive energy (since free particles must have positive 
energy). It is customary to relabel the antiparticle states to v and flip the sign of the energy and momentum, so 
that we can use E = y / p 2 c 2 + m 2 c 4 everywhere without having to worry about whether the energy belongs to a 
particle or antiparticle state. The four independent solutions valid for finite momentum then read: 
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1 

0 

CPz 

£7 

to 

II 

0 

1 

CP- 

II 

cp- 

E+mc 2 
Cp z 

N 

1 

ts 

0 + 

3 

0 

to 

1 __ 


E-\-mc 2 
cp+ 


E-\-mc 2 

cp z 



L E+mc 2 J 


L E+mc 2 J 



.( 2 ) = - 


= —N 


CPz 

E+mc 2 

E-\-mc 2 

1 

0 


(4.39) 


where we defined N = y/(\E\ + mc 2 )/c and p± = p x db ip y . Thus, i/W are two spin states for an electron 
with energy E and momentum p while are two spin states for a positron with energy E and momentum p. 
Notice that the ^-spinors satisfy ( 7 — mc)u = 0 while the u-spinors satisfy ( 7 + mc)v = 0 since we have 
reversed the sign of E and p by convention. 

Plane-wave solutions are particularly useful since they describe particles with specific energies and momenta, 
both quantities that are usually possible to control quite well in experiments. 

Transformation of Dirac spinors. 

We mentioned previously that a Dirac spinor is not a 4-vector, meaning that it does not transform via the Lorentz 
transformation under a change of inertial system. How does it transform then? The full proof is not given here and 
the reader is referred to e.g. the textbook by Bjorken & Drell. The reader is instead encouraged to try to derive the 
transformation rule via the following strategy. We want to discover how the solution ip of the Dirac equation in 
one frame, which satisfies 


i — map = 0 

looks in a different frame, ip f = Sip, where the Dirac equation reads 

(4.40) 

ih^d'^' — mc'ip' = 0 . 

(4.41) 


To help us, we know how the differential operators transform as follows under a change of inertial system: 

dx v 


d ' =E_ 

M q x nr 


dx^ f 


d u . 


The result is that: 

ip -+ ^ == Sxp 

where the ' system is moving with speed v in the x-direction and 


S = 


a +1 

a_a x a + l 


Besides 7 = 1 / y/l — ( v/c ) 2 as usual, we defined 

a± = ±^( 7 ±l). 

Now, ipty does not transform as a scalar under a change of inertial system because (' ip^xp )' ^ ip^vp: 

= (^ f )V = ^sflsip 

and 


S^S = ' 


1 

V 

-c£ 1 


^ 1 - 


(4.42) 


(4.43) 


(4.44) 


(4.45) 


(4.46) 


(4.47) 
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Similarly to a 4- vector, only the proper metric (relative sign between the components x will make a quantity 
transform like a scalar. In this case, it turns out to be 

ijip = V’bV = IV’il 2 + IV>2| 2 - l^l 2 - Hi\ 2 (4.48) 

which is a relativistic invariant. Here, we introduced the adjoint spinor: 

ijj = V^7°. (4.49) 

It is also of interest to consider the transformation properties of ij> under a parity transformation. Recall how 
scalars and pseudoscalars are distinguished based on how they transform under a parity operation. We have seen 
that ijn/j is Lorentz-invariant (transforms as a scalar under a change of inertial system), but how does it transform 
under parity? Using a similar strategy as above (working out the parity transformation effect on the differential 
operators), one finds that parity has the effect: 

i/j —>> ijj'm 7°^- (4.50) 

It follows that these relations hold for a parity transformation: 

• xjji/j = scalar. 

• ' 07 5 '0 = pesudoscalar. 

• ijj 7 ^ = vector. 

• ' 07 ^ x 7 5 '0 = pseudo vector. 


B. The photon 


Classical electrodynamics is described by Maxwell’s equation (using a Gaussian units convention, but the proce¬ 
dure below is equally valid for SI units): 

(i) V • E = 47rp, (ii) V • B = 0, 

(i!i)Vx£+~ = 0, (»)VxB--f a-J. (4.51) 

c at c at c 

Here, p is the charge density and J is the current density. In relativistic theory notation, one introduces the 
antisymmetric field tensor : 


0 

—E x 

Ey 

~E Z 


E x 

0 

-B z 

By 

(4.52) 

Ey 

B z 

0 

-B x 

E z 

— By 

B x 

0 



and the 4-vector = (cp, J). The first and fourth of Maxwell’s equations listed above can now be compactly 
written: 

d = —r. (4.53) 

c 

Since F^ u is an antisymmetric tensor, F^ v = —F u ^, one finds that d^J^ 1 = 0. This can be written as 

wj =-% <4 54 > 

which is simply the charge continuity equation. On the other hand, the homogeneous equations (the second and 
third of Maxwell’s equations listed above) may be reexpressed via the scalar potential (j> and vector potential A as 
follows: 


B = V x A, E = —V</> - 


idA 

c dt 


(4.55) 
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which can be compactly expressed through: 


= d»A v - d v A*. (4.56) 

The fields E and B are the physical observables, whereas the potentials and A [or simply A 1 = (0, A)] are not 
uniquely determined: 


(4.57) 

gives the same fields since d^A vr — d v A 1 ' = d^A v — d v A 1 . Here, A is any differentiable function of position and 
time. Eq. (4.57) is an example of a gauge-transformation, because we can choose any particular gauge (A) that 
we like. For instance, d^A 1 = 0 is the Lorentz condition, which means that we restrict A to satisfy DA = 0 where 
□ = d^d^. In this gauge, the inhomogeneous Maxwell equations simplify to 

= AirJ^/c. (4.58) 

For empty space with = 0, we may choose a gauge with A 0 = 0 such that the Forentz condition takes the form 
V • A = 0. This special case is known as the Coulomb gauge. 

Note that by selecting A 0 = 0, we restrict ourselves to one particular inertial system, and thus break so-called 
Lorentz covariance (which is the property that the equations are written in a form which is valid in any inertial 
system). Alternatively, we must perform a gauge transformation along with every Forentz transformation in order 
to keep the Coulomb gauge intact in a new inertial system. 

In QED, A 1 can be thought of as the "wavefunction" of the photon. For a free photon (= 0), the equation of 
motion for this wavefunction in our chosen gauge takes the simple form: DA M = 0. Closer inspection reveals that 
this is in fact equivalent to the KG equation for a massless field (m = 0). It has plane-wave solutions of the type: 

A»{x) =ae- ip ' x/ V>), (4.59) 
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where is the polarization vector characterizing the spin of the photon. Inserting this solution into DA^ = 0 
[which is valid only under the Lorentz condition gauge], we find p^p^ = 0. This means that indeed m = 0 and 
E = \p\c. Now, the Lorentz condition dictates that p^e^ = 0. In the Coulomb gauge, we further have that 

e°, e • p = 0. (4.60) 

Therefore, the polarization vector e is T to p: a free photon is transversely polarized in the Coulomb gauge. For 
a given p, there are thus two linearly independent 3-vectors perpendicular to p, for instance €7) = (1, 0, 0) and 

C(2) = (0,1,0) for poc (0,0,1). 

It might seem puzzling that there are only two polarization states since the photon is a spin-1 particle (s = 1). 
Should there not be three available spin states? We might expect this based on how massive particles of spin s 
behave: in this case, we are used to there being 2s + 1 different spin orientations. However, this changes for 
massless particles of spin s. For such particles, there are only 2 different spin orientations regardless of 5 except 
for s = 0 in which case there is only one spin orientation. This fact is related to the fact that the photon has no rest 
frame as it moves with the speed of light. 

With our gauge-choice, we explicitly eliminated the m s = 0 solution. However, the same physics would transpire 
if we didn’t specify the gauge. In that case, longitudinal "ghost" photons decoupled from everything else would 
appear (which would thus not be of any physical consequence since they would not interact with anything). 


C. Feynman calculus: application to decays and scattering 

In order to develop a quantitative formulation of elementary particle dynamics, such as decay rates T and scattering 
cross sections a, we need two main ingredients: 

• Evaluate Feynman diagrams to find the amplitude M for a given process. 

• Use Fermi’s Golden rule to compute T or a from Ai. 

Decay rate and scattering cross section. 

The lifetime of a particle implicitly refers to a particle at rest, since otherwise we always have to take into account 
time dilation. Even so, we can only compute the average lifetime of particles from a large sample. The decay rate 
T is the probability per unit time that a muon will disintegrate. For a large sample N(t), we have 

dN = -TNdt N(t ) = N 0 e~ rt . (4.61) 

The mean lifetime is then defined as r = T -1 . If a particle has several decay routes T^ (for instance, a 7r + can 
decay to /i + + , e + + v e + 7, et.c.), the total decay rate is 

n 

r tot = J 2 Ti (4 - 62) 


The branching ratio is defined as r^/r tot . 

Concerning scattering, it gives information about how particles quantitatively interact with each other. Imagine 
firing a stream of arrows against a target as an analogue of particles scattering. 

• Unlike a physical object, the arrows (particles in our case) do not simply miss or hit. Instead, their deflection 
depends on the distance to the target. In this way, the effective scattering cross section cr e ff 7^ A where A is 
the physical area of the object in general. 

• The scattering cross section a also depends on the arrows themselves. Particles scatter differently due to 
different interactions, such as electromagnetic or weak ones. 

• Finally, there is also a dependence on the final state of the arrows, such as whether or not the scattering is 
elastic or inelastic. 
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Each process has a specific scattering cross section. It depends on the energy of the particles as well: so-called 
resonances in the a — E diagram indicate bound states such as short-lived particles. 


a 



The scattering cross section is a measure for how likely a particle reaction is to occur. In fact, we can measure the 
particle scattering cross section a for a given reaction via the rate W at which it occurs. This is because W oc a. 
When there are similar routes to a final end-product one performs (similarly to the decay rate T): 

n 

(Ttot = <Ti (4.63) 

i=1 

where n is the total number of processes leading to the end-product. For a particle scattering off some potential 
center, we may envision the situation as follows: 



where b is the impact parameter and 9 is the scattering angle. The reader is assumed to be familiar with the 
classical treatment of a scattering cross section and we will not give a detailed treatment of classical scattering 
theory - if required, the reader can find a comprehensive treatment here. We denote the differential scattering cross 
section as do jd£l. 

The Golden rule. 

The basic ingredients for calculating decay rates and scattering cross sections quantitatively will be the amplitude 
A4 of the Feynman diagram for a given process (which contains information about how the interactions give rise 
to the process, how they depend on energy and momentum, and so forth) and the available phase-space. More 
phase-space indicates a more likely process, in general. 


Example 6. Heavy particle decay. When a particle with a large mass decays into several light secondary particles, 
a large phase-space is involved. This means that there are many ways to distribute the available energy and causes 
(generally, but there are exceptions) heavy particles to decay faster than light particles. In contrast, a process such 
as n —>• p -h e~ -j- P e has very little extra mass to spare since m n and m p are almost equal, and thus very small 
phase-space. As a result, the neutron has a very long lifetime (compared to other particles that decay via weak 
interactions). 
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Before we work out a quantitative theory concerning decays and scattering cross sections, we state Fermi’s Golden 
rule in a qualitative form to promote the physical understanding: 


27t 

Transition rate = —\M\ 2 x phase-space. 


We will now look at two particular cases of interest where we quantify what phase-space means mathematically. 
Golden rule for decays. 

The decay rate for a process 1— >>2 + 3 + 4 + ... + n (which is thus a very general process) is given by 


dT= \M \ 2 


5 r 

Y Cd 3 p 2 \ 

/ cd 3 p 3 \ 

/ cd 3 p n y 

2hmi . 

V(27r) 3 2£;2/ 

V (27r) 3 2£’ 3 / ' 

" V(27r) 3 2£; n /J 


X (2tt) 4 £ 4 (pi -P2 -P3 - • • .p n ). 


It is assumed here that particle 1 is at rest, so that pi = (mic, 0). In general, pi = (. Ei/c , pf) is the 4-momentum 
for particle i with mass rrii ( Ef — pfc 2 = m^c 4 ). The detailed derivation of the expression for dT is beyond the 
scope of this textbook, but we shall in fact later outline briefly how it is derived using quantum field theory. The 
^-function enforces conservation of E and p. S is a product of statistical factors: it contains a factor 1 /j\ for each 
group of j identical particles in the final state. The expression dT is to be understood as the differential rate of 
decay where the momentum of particle 2 ends up in the range d 3 p 2 around p 2 , and so forth for the remaining of 
the produced particles. The total decay rate is obtained via integration. For instance, with two particles in the final 
state: 


r = 



\M \ 2 
E2 E% 


5 4 (pi ~P2 - P3)d 3 p 2 d 3 p 3 . 


In general, the amplitude will depend on the momenta: M = M(p 2 ^Ps)- 


(4.64) 
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Example 7. Two-body decay. Consider the process mi —)> m2 + m 3 . Assume that Ad is a known quantity and 
compute T. Since Ej = wm 2 c 4 + p 2 c 2 , we have 


= S f \MlSjmic- y/m 2 c 2 p 2 - y/w|c 2 + p| 3 

2(4 7 r)2ftm 1 J 

We have carried out the integral over d 3 p 3 by noting it only gives a contribution for p 2 = — p 3 : 

S 4 (pi -P 2 ~ P 3 ) = S(rriic - E 2 /c - E 3 /c)5 3 (0 -p 2 ~ p 3 ). (4.66) 

Now, |Ad| 2 only depends on p 2 since we have set p 3 = — p 2 . In fact, it should only depend on \p 2 \ since it is 
a scalar (only the combination p 2 • p 2 represents a scalar). Introducing spherical coordinates and performing the 
angular integration gives: 


r = S f°° |A4| 2 (5(mic- - y/m|c 2 +p| 

Sithmi J 0 ym| c 2 p 2 V^3C 2 P3 

Here, we defined p = |p 2 |* To solve this integral, introduce the total energy of the final particles: 


E = c 




(4.67) 


(4.68) 


This yields: 


Sir hm 1 

Finally, using that 5{m\c — E/c) = cS(E - 

where po is the value of p which gives E = 


l 


(m 2 +m 3 )c 2 


\M\ 2 ^5(m lC - E/c)dE. 
E 


- mic 2 ), we obtain 
T = S\M\ 2 po/87rhmlc 
m \(?. Explicitly, it reads: 


(4.69) 


(4.70) 


po = cymf + m 2 +m| — 2 m\m 2 — 2m 2 mJ — 2m2m 2 /2mi. (4.71) 

Also, | Ad | is evaluated at the momentum consistent with energy conservation and momentum conservation. Note 
that mi > m 2 + m 3 in order to perform the integration. This makes physical sense, since otherwise particle 1 does 
not have enough rest energy to produce the masses of particles 2 and 3, let alone provide them with any kinetic 
energy. 


Example 8. Decay rate of 7r° 7 + 7. Use the formula derived in the previous example and set m 2 = m 3 = 0, 

leading to po = mic/2. Moreover, we have to set S = 1/2! = 1/2 since we are producing two identical particles. 
Note that both of these examples were done without knowing the explicit form of Ad. This is not possible for 
decays into three particles. 


Golden rule for scattering 

Suppose that 1+2 collide and produce a number of particles via the process l + 2^3 + 4 + ...+n. The scattering 
cross section is then: 
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,-- M 2 h2s \ 

/ cd 3 p 3 \ / cd?p n \i 

X ( 27ri 4 ri 4 f n-t Do — 71 q — n i 

4\/(piP2) 2 - (toito 2 c 2 ) 2 L 

\(2n) 3 2E 3 ) ’"\(2n) 3 2E n J\ 

a ) u yp 1 \ p 2 ps • • • Pn) • 


Typically, one studies at which angle one of the final particles emerge (say, particle 3): the integration required 
to find the cross section for this process is then performed over all the other momenta (p 4 ,p 5 ,... ,p n ) and the 
magnitude of p 3 . We are then left with the differential scattering cross section do/dVt for scattering particle 3 into 
solid angle dQ. 


Example 9. Scattering 1 + 2—>3 + 4 in the CM frame. Assume that M is known and calculate do /dCl. 



In the CM frame, we have p x = — p 2 which leads to pi • p 2 = E 1 E 2 /c 2 + p\. Therefore: 
/ he\ 2 S\M.\ 2 c d 3 p 3 d 3 p A 


do = — ) 


S[(E t + E 2 -E 3 - E a )/c] x S 3 (-p 3 - p 4 ). 


V87t/ (E\ + -£/ 2 ) \pi | E 3 E a 
Now, express the energies with p 3 and p A and perform f d 3 p A so that p A = — p 3 everywhere: 


-( 


da= — 


h \ 2 S\M\ 2 c 5[{Ei + E 2 )/c - i+I^+pI- j 3 


87r/ (El +E 2 )\Pi | 


\J m|c 2 + Pg y m|c 2 + p§ 




(4.72) 


(4.73) 


This time, |A4 | 2 can in general depend not only on |p 3 |, but also its direction since it depends on all momenta: 
due to the assumption of working in the CM system, it depends on p 1 and p 2 , so that the direction may play a 
role from terms such as p-, • p 3 . We cannot carry out the integration over <7(1 unless ,M | 2 is specified, but we can 
integrate over |p 3 1 : 


— = (—\ 2 Sc l- A/f l 2 ^[(- E i+^ 2 )/c-A/mfc 2 +p| - y^lc 2 +pj] 2 
dfl \87rJ (Ei + E 2 )\pi\ yjm 3 c 2 +p 3 \/m\c 2 + p\ 

where we introduced p = |p 3 |. This is the same integral as in our previous example of a decay if we perform the 
substitutions m 2 —> 7714 and mi —> (E?i + E 2 )/c 2 . The result is: 

da /fi\ 2 <S|A4| 2 c |P/| 

dfi V87T/ (E!+i; 2 ) 2 | Pi |' k ; 

Here, |pj| is the magnitude of either final momentum (does not matter since we work in the CM frame) and \p z \ 
is the incoming momentum. As before, it is implicitly understood that \M\ 2 in the final expression is evaluated at 
these momenta. 


Let us also say a few words about the dimensions of the quantities we have considered. We have [T] = s _1 and 
[cr] = m 2 , although in practice the cross section is more commonly given in cm 2 or even barn where 1 barn 
= 10 -24 cm 2 . The dimension of the Feynman amplitude is [M] = (mc) 4-n , in effect momentum raised to the 
power 4 — n where n is the number of incoming plus outgoing particles. 
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How does one derive the formula for dT and dcr? 

We here outline how the formulae we have given for dT and da are derived, following the 1988 textbook by Mandl 
and Shaw. Both decay and scattering have in common that one starts with an initial state | i) and ends up in a final 
state |/). The rate or likelihood of this process is determined by matrix elements of the type |(/| < h(oo ))| 2 where 
|T>(oo)) is the state into which | i) evolves as t oo. It includes all possible states that | i) can evolve into, and 
the matrix element thus measures the overlap with one specific final state | /) that we are interested in. Now, to 
find |$(oo)) we use that |<f>(oo)) = S\i) where S is the ^-matrix determining the time-evolution of \i). S may be 
obtained via the time-dependent standard QM equation of motion (Schrodinger equation): 

(4.76) 

in an iterative fashion. Here, Hi(t) contains the interactions that govern the decay/scattering process. One finds 
that: 


00 (—j) n r°° r°° 

S = Y,^~T dti I dt 2 ...T{H I (t 1 )H I (t 2 )...H I (t n )}, (4.77) 

where T is the time-ordering operator. Essentially, we thus need to compute Sfi = (f\S\i) for the specified states 
| i) and \f) for a given interaction Hj. When computing this matrix element, one typically ends up with a result of 
the type: 


S I ‘=%+ (2 *)v(En - 5>) IT * n {nh;)'' 2 ™ < 478 ) 

where C is a constant and M contains info about external lines, i.e. the states | i) and |/), and the relevant 
interactions Hi. After this, we just have to multiply with an appropriate factor for the number of states available 

[typically ], and we are left with dT or da. 
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Feynman rules for QED 

Turning now to QED specifically, let us preface our discussion of the Feynman rules with a summary of the 
properties for electron and positron wavefunctions and there properties. By electron and positron, we really mean 
particle and antiparticle in general, but for concreteness we focus on those particular cases. The wavefunctions for 
free electrons and positrons of momentum p = (E/c,p) and E = y / m 2 c 4 + p 2 c 2 have the following form. 


Electrons: vp(x) = ae~ ipx ^ h u^ s \p) where 8 = 1,2 denotes two spin states. The spinor satisfies (y p p^ — mc)u = 
0 and its adjoint u = v) 7 0 satisfies ui^y^p^ — me) = 0. Moreover, the spinors are orthogonal, normalized, and 
complete: 

=0, uu = 2 me, = (y^p^ + mc). (4.79) 

6 - 1,2 

Positrons: 'ip(x) = ae ipx / h v( s \p) where 8 = 1,2 denotes two spin states. The spinor satisfies (y^p^ + mc)v — 0 
and its adjoint v = satisfies v{y p p^ + me) = 0. Moreover, the spinors are orthogonal, normalized, and 
complete: 


v (i) v ( 2 ) = 0 , vv = —2me, ^ ^00^00 = ( 7 - me). (4.80) 

6 = 1,2 

The completeness relations are important as one normally averages over electron and positron spins (since these 
are often random in experiments), so that one requires the complete set. Note that all of the above relations are 
equally valid for e.g. and p + , quarks and antiquarks, and so forth (since all are spin- 1/2 particles). 

Photons: The wavefunction reads A^{x) = ae~ ipx / h e^ with 8 = 1,2 being the two spin polarization states. The 
polarization vectors satisfy e M p M = 0 (Lorentz condition) and are orthogonal and normalized: 

(< 1) )*e M2) =0, (e^)%7) = l. (4.81) 

In the Coulomb gauge, e° = 0 and e • p = 0, and one has 

T,( e l‘M<s))j=8i j -PiPj (4-82) 

6 = 1,2 

for the 3-vectors. An explicit pair is, as previously mentioned, C( 1 ) = ( 1 , 0 , 0 ) and e (2) = ( 0 , 1 , 0 ). 

With the above notation in mind, we now explain how to compute Aifor a given Feynman diagram'. 

1. Notation. Label the incoming and outgoing momenta and spin with pi,P 2 , • • • and si, 82 ,... Internal mo¬ 
menta are labelled q n . Arrows on external lines indicate if it is an e~ or e + , while arrows on internal lines 
are conventionally assigned so that the direction of flow is preserved (one arrow in, one arrow out). External 
photons have arrows that point forward: 
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2. External lines. 



3. Vertex factors. Each vertex contributes i g e ^ where g e = is a dimensionless coupling constant. For 

quarks where \q\ ^ e, one should use g = —q^/Air/hc. 

4. Propagators. Internal lines contribute with 


+ me) 
q 2 — m 2 c 2 

for electrons and positrons, while photons contribute with 


n 2 ‘ 


(4.83) 


(4.84) 


5. Conservation of E and p. For each vertex, write (27r) 4 £ 4 (&i + k<i + ks). Positive k for arrow into the 
vertex and negative k for arrow out of the vertex, except for external positrons. 

6. Integrate over internal momenta. For each internal q, write f d 4 q/{ 27 r) 4 . 

7. Cancel the ^-function. Remove ( 2tt) 4 S 4 (pi + p 2 + ... — p n ), and what remains is —i M. It is worth 
remarking that if one chooses C = +i in Eq. (4.78), one ends up with +LM which is a commonly used 
convention. However, the overall sign does not matter since it is \M\ 2 that is used to compute quantities 
such as decays and scattering cross sections. We shall stick with the convention that one ends up with — \M 
after cancelling the ^-function. 

The total amplitude M is then the sum of all the amplitudes for each contributing diagram while taking into 
account: 


Antisymmetrization. Include a relative minus sign between diagrams that differ only by either (1) interchanging 
two incoming (or outgoing) electrons or positrons (2) or that differ by exchanging an incoming electron with an 
outgoing positron or vice versa. This rule is taken into account in order to incorporate the Pauli principle for 
fermionic wavefunctions. 


Eater, we will comment explicitly on how to deal with fermion loops which are of relevance in so-called vacuum 
polarization diagrams. To test our skills in using the Feynman rules, we now work out A4 in detail for some of 
these processes. 


Example 10. e — p scattering. The lowest-order diagram looks as follows. 

P3,S 3 p 4 ,S 4 
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Using the Feynman rules, we obtain the following complex scalar: 

(27r) 4 Aw (si) (P3)(i5e7 A1 )'« (si) (Pi)]( - "Ar) [' aM u t ' S2 \p 2 )]6 4 ( Pl -p 3 -q)S A (p 2 + q-pi)d A q. 

“ (4.85) 


After performing the ^-integration and dropping the ^-function, we are left with 

M = - ^ ^ [u {s3 \p 3 )Yu^\ Pl )}[u^\p 4 )^u^\p 2 )}. (4.86) 

(Pl-P3) Z 

We emphasize again that At is a number. 


Example 11. e — e scattering. The diagram is similar to the above example, but there are now two contributing 
diagrams since electrons are identical particles and cannot be distinguished. Therefore, we have to account for 
an additional diagram where (^ 3 , 53 ) (^ 4 , S 4 ) since we cannot tell which electron that is coming out where. 

As a result, the total amplitude is obtained by adding the two contributions with a relative minus sign due to the 
aforementioned antisymmetrization rule: 

M = ~ ( pi Ap 3 ) 2 fo^bM 1 )]^ 4 )^ 2 )] + ( pi A p4 )2 [^( 4 )7 /j ^(1)][^(3)7/4«(2)]- (4.87) 

We were able to simply write down this Ai by using our result from the e — fi scattering example. For brevity, we 
introduced the notation u(i) = u^ Si \pi). 
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The first contribution [(a)] is evaluated, according to the Feynman rules, to the following complex scalar: 

(2tt) 4 j[u(2,)(\g e Y)u{l)\ ( - ^) [v(2)(ig e j l/ )v(4:)\5 4 (p 1 - p 3 - q)S 4 (p 2 + q - Pi)d 4 q. (4.88) 

Note how the order is always adjoint spinor/gamma matrix/spinor when moving along a fermion line, in order for 
the dimensionality of the factors to match. Moving backwards (along the arrow) of an antiparticle line is equivalent 
to moving forward in time. The second contribution [(b)] provides: 

(2t:) 4 J [u{2,)(\g e ^)v(A)](^ - ^) [v(2)(ig e j u )u(l)\5 4 (q - p 3 - _p 4 )<5 4 (Pi + P 2 ~ q)d 4 q. (4.89) 


By interchanging the incoming e + and outgoing e , we obtain 



which is equivalent to the first contributing diagram. Therefore, due to the rule of antisymmetrization, the two 
amplitudes must add with a relative minus sign. 


D. Casimir’s trick and trace theorems 

If all spins and polarizations are known a priori in the experiment, the appropriate spinors and polarization vectors 
are simply inserted into the expression for M . However, it is more often the case that the spins are not known, 
but random. Then, the cross section is given by the average over all initial spin configurations and the sum over 
all final spin configurations. Thus, (\M\ 2 ) = average over initial spins and sum over final spins. Let us introduce 
some convenient notation known as Feynman slash notation: ^ 7 ^, = 7 ^^, and T = 7 °r^ 7 ° for a 
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matrix V. Consider now e — fi scattering. From our previous result for M for this process, we find: 

\M \ 2 = -^-3^ I [M(3)7 M w(l)][M(4)7 Jti 'u(2)][M(3)7 !/ 'u(l)]*[u(4)7 1/ 'u(2)]*. (4.90) 

This mathematical object is built up from the schematic structure: G = [u(a)Tiu(b)][u(a)T 2 u(b)]*, where Ti ? 2 
are 4 x 4 matrices. For a scalar (lxl matrix) the operation * is equivalent to which means that 

[u(a)T 2 u(b)]* = [vJ (a) r y°r 2 u(b)]^ = u(a) = u(b)T 2 u(a) (4.91) 

where we used that 7 °^ = 7 0 and ( 7 0 ) 2 = 1. We now perform a summation over the spin orientations of particle 
b: 

^2 G = «(a)rJ ^2 u < ' Sb '>(p b )u < ' Sh \p b ) if 2 «(a) = u(a)Ti(tf b +m b )t 2 u(a) = u(a)Qu(a), (4.92) 

b spins v Sb = 1,2 ) 

where Q = T i (/$& + rribc)T 2 . Similarly, for a summation over the a spins we get: 

T y — Qijl y; w (Sa) M (Sa) i = Qij(^a + TO a c) ji = Tr{Q(/S a + TO a c)}. (4.93) 

a spins b spins 1 s a = 1,2 J 

Here, Tr(A) = JA We also used that u and fi are 4 x 1 and 1x4 spinors, respectively. In total, we thus have 
y [w(a)ritt(6)][u(a)riw(6)]* = Tr{Ti(^ b + m 6 c)f 2 (^ a + m a c)}. (4.94) 

all spins 

We see that there are no spinors left after the summation over spins: only matrix multiplication and a trace at the 
end. This procedure is known as Casimir’s trick. Note that if u is replaced with v, the corresponding mass should 
change sign. 


For e — /i scattering, we then have r 2 = Y and T 2 = Yl^l^ = Y • We find: 

(|A4| 2 ) = x 4 Tr ( 7 m (^i + me) + TOC )}Tr{7^(^ 2 + Me)^ n u(j> A + Me)}, (4.95) 

where m = m e , M = ra M , and | is included to average over initial spins (2 particles with 2 configurations each = 
4 configurations). To evaluate the traces appearing in (|A1| 2 ), there is a set of trace theorems that may be derived 
by using the fundamental mathematical properties of the Tr operation. Some useful relations, which we will draw 
upon in later calculations, are: 

• Tr{A + B} = Tr{A} + Tr{5}. • Tr{AB} = Tr {BA}. 

• Tr {aA} = aTr{A}. • g /lu g fJjl/ = 4. 

• t'V + Yl* = 2 g^. • ^ = 2a6. 


From this follows, for instance, that Tr{ 7 ^ 7 ^} = 4g Mzy and Tx{0] = 4a6. Also, the trace over an odd number of 
7 -matrices is equal to zero (try to prove this yourself, by using that Tr{ 7 5 } = 0 and { 7 5 , 7 M } = 0 ). 


E. Treatment of bubble diagrams 

In some Feynman diagrams, fermion loops (so-called bubbles) appear. The general rule for treating these is: 

Include a factor —1, take the Tr, and follow the fermion lines along the arrows. 

Taking the Tr physically corresponds to including all possible spin orientations of the fermions as it connects with 
itself in the end, in the same spirit as the integration over all internal momenta. 
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Example 13. Evaluating a fermion bubble. Consider the following contribution to a Feynman diagram. 

k — q 



Using the rule stated above, we obtain 


-Tr 


/ 


~j + m)} 

(27r) 4 [(k — q ) 2 — m 2 c 2 ][k 2 — m 2 c 2 \ 


(4.96) 


However, is the direction in which we follow the arrows important? Could we instead follow the arrows in the 
opposite direction? The answer is that for a loop with two (in general, an even) number of fermions, we can. The 
proof goes as follows. Consider the trace Tr{ 7 M ^ 7 v j[\ obtained by traversing a fermion loop in a given direction. 
This can be written as: 


lixivia) = ^p x p a (g^\gua - g^gx* + g^gxv) = 4 (p^q u - g^p x qx + q^Pu)- 

(4.97) 


If we instead had traversed the fermion loop in the opposite direction, we would have obtained But 

Eq. (4.97) is invariant upon exchanging p q. Therefore, the two traces must be identical and the direction 
in which we move around the loop does not matter. This proof can then be generalized to any even number of 
fermions, where one makes use of the general formula Tr{ 7 Ml 7 M2 ... 7 Mn } = Tr{ 7 Mn ... 7^7^! }• 
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F. Computation of various cross sections and lifetimes 

We are now in a position to explicitly evaluate da/dQ for e.g. Mott and Rutherford scattering. If m e scatters off a 
much heavier muon M m e , the task is to find da/dQ in the lab frame where M is at rest when neglecting the 
recoil of M. Using the rules for cross-section related to M given previously, one finds 

Before After 



E, Pi 

We now want to compute M. explicitly. We have 


Pi = {E/c, Pi), P 2 = (Me, 0), p 3 = ( E/c,p 3 ), Pi = (Me, 0). (4.99) 


where \p 1 | = |p 3 | due to our assumptions. The averaged amplitude is given by (using Casimir’s trick): 


(I M\ 2 ) = 




-\(jPiP2)(jp3Pi) + (PiP4)(P2Pz) - (pip 3 )M 2 c 2 - (p2Pi)m 2 c 2 + 2 (mMc 2 ) 2 ] (4.100) 


(Pi -P 3) 4 

by evaluating the traces that arise. Inserting the above 4-momenta gives: 

(I M \ 2 ) = ( 2 ^ Yttmef + p 2 co S 2 (fl/2)]. 

\p z sin (0/2) / 

Inserted into our expression for the differential scattering cross section, we obtain the Mott formula: 


\ = 


ah 


[(me) 2 + p 2 cos 2 (6>/2)]. 


\dflJ lab V2p 2 sin 2 (^/2) > 

In the non-relativistic limit p 2 <C (me) 2 , one arrives at the Rutherford formula: 


da 


dVt V 2 mv 2 sin 2 (0/2) 


) 2 


(4.101) 


(4.102) 


(4.103) 


Whereas the above illustrates an example of how the differential scattering cross section may be evaluated fully, 
including the calculation of M, we should also consider an example of a decay. But a moment of reflection 
reveals an interesting fact: there are no decays in pure QED. There is no mechanism that can convert e.g. p~ to 
e~ since fermion lines cannot simply terminate in a diagram. If we allow for neutrino oscillations, such a process 
can become possible, but then we have introduced a type of interaction that is not contained in pure QED. On the 
other hand, annihilation events such as e~ + e + —^7 + 7 are fully possible in pure QED, but this is conventionally 
viewed as a scattering event rather than a decay. 


As an example, let us in fact analyze this annihilation in the positronium rest frame (the CM-frame for the e - e + 
pair). We assume that the bound-state is in the singlet spin configuration and at rest to begin with. There are two 
contributing diagrams, as seen in the figure. 
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The corresponding amplitudes are 

2 2 
Ml = ( pi _p 3 )2_ yn 2 c 2 0 ( 2 )A(^l - fa + rnc)fau{ 1), M 2 = (pi _ P4 ^2_ TO 2 C 2 ^( 2 )^(A. - ^4 + ™)/ 4 «(l). 

(4.104) 


We have omitted the complex conjugation sign on the polarization vectors for brevity of notation - however, we 
will reinstate this in the final result. The total amplitude reads M = + M 2 and according to our statements 

about the frame and starting point we have the following 4-vectors: 

Pi = mc( 1, 0,0, 0), P 2 = mc{ 1, 0, 0, 0), ps = mc( 1, 0, 0,1), ^4 = mc(l, 0, 0, —1). (4.105) 

It follows that (pi — ps) 2 — m 2 c 2 = (p% — p a) 2 — m 2 c 2 = —2 (me) 2 . Now use the rule 

2 * 1/3 + / 3/1 = 2pie 3 (4.106) 

and invoke the Coulomb gauge e° = 0. Since = 0, this gives 2*1/3 = —/3/1. Moreover, 2*3/3 = —fafe since 
P 3^3 = 0 due to the Lorentz condition. We finally also use that (pi — mc)u( 1) = 0 [since u( 1) solves the Dirac 
equation] leads to 

(2*i - 2*3 + mc)fau( 1) = /sM(l). (4.107) 

Inserting all of this into M produces 

M = ~ 2(mc) 2 ^(^) ^3h +/3/42*4]^(l)« (4.108) 

Using the properties of the 7-matrices the contraction rules, we find that the above can be rewritten as 

a 2 

M = 7 ^-fi(2)[e 3 • 647 ° + i(c 3 x 64 ) • S 7 3 ]^(l) (4.109) 

(me) 

where we introduced 


£ = 


a 0 
0 <7 


(4.110) 


Now, take into account the spin-singlet symmetry so that the final amplitude in the singlet state reads: 


•Msinglet = (M n - Mtf)/V2. (4.111) 

We have obtained an expression for M in Eq. (4.109), so to evaluate we simply have to use a spin-up spinor 
for the electron and spin-down spinor for the positron: 


u(l) = y/2 me 


1 

0 

0 

0 


v{2) = V2mc[0,0,1, 0]. 


(4.112) 


This gives v(2) r y°u(l) = 0 and v(2)Y^j 3 u(1) = —2 mez, so that 

Mfi = -2ige 2 (e 3 x e 4 ) z . (4.113) 

The same procedure for the opposite spin configuration yields 

M±f = 2ige 2 (e 3 x e 4 ) z = (4.114) 

This means that the total amplitude, according to Eq. (4.111) is: 

Vlsinglet = -2\/2i5e( e 3 x e 4 ) z (4.115) 

for annihilation of a stationary e - e + pair in a spin-singlet configuration into two photons which emerge in the 
directions ±z. 


Note that the amplitude for this process in a triplet configuration (tl + \^)/V2 gives 0, since 
The reason is conservation of charge conjugation in EM processes which only allow 27, and not 37, from e _ e + in 
the singlet configuration. On the other hand, in a triplet configuration the 37 production is allowed from a charge 
conjugation point of view. 
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G. Renormalization 

We now consider specifically higher-order diagrams which require renormalization. Consider e — fi scattering. 


? ^4 



With q = pi — p 3 , the amplitude for this diagram is: 

2r- 


M = -gl[u{pz)Yu(p 1 )]^ r [u{p A )Yu{p2)}- 


(4.116) 


A higher-order correction (4th order, to be specific) to this process is the vacuum-polarization diagram shown 
below. 
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The origin of this name is that the temporary e _ e + pair modifies the effective electric charge of the electron (or 
alternatively put, the effective coupling constant of the electron). One may derive the amplitude by using the extra 
Feynman rule for fermion loops that we treated earlier: for each closed fermion loop, include an overall factor —1 
and take the trace. This gives: 


M = - 


</ 4 


[«0 3 )7^u(pi)] 


(T k Trfa (Jt + mc^ (Ji - + me)} 

(27r) 4 (k 2 — m 2 c 2 )[{k — q) 2 — m 2 c 2 ] 


[u(p 4 )7 I 'u(p 2 )] 


(4.117) 


Including this contribution, one observes the higher-order diagram can in fact be incorporated as an effective 
modification of the photon propagator: 

- -»•{ / 0 - } < 411 >® 

The problem is the integral I lLV is logarithmically divergent upon evaluation. Our strategy for dealing with this 
problem is representative for the main idea behind renormalization: we will start by absorbing this "infinity" 
into a new mass and coupling constant. This might seem horrendous from a mathematical point of view, but let 
us see how it works out in the end. The general form of after integration (only q remaining as a 4-vector) is 
• •)• We thus write I^ v = —ig flu I(q)-\-q fI q u J(q). The second term actually makes no contribution 
to the final amplitude, which can be verified as follows: q M contracts with 7^ in the expression for M. and thus 
gives 

[u(3)^u(pi)] = u(p 3 )(tf 1 ~h)u(pi) = 0 (4.119) 


as seen from the basic Dirac equations for u and u. The first term, on the other hand, may be rewritten in the 
following manner (the calculation is beyond the level of this book): 


m = 


_9e_ 

127T 2 L 


[ — -&[ z(\ — z)ln(l - %-oZ(l - z))dz. (4.120) 

Jm 2 X Jo \ m 2 c 2 J 


We thus manage to contain the logarithmic divergence in the first term of this quantity. If we temporarily introduce 
a cut-off A instead of the infinite limit of the integral, and define 


f{pc) = 6 [ z(\ — z) ln[l + xz(\ — z)\dz 

Jo 


with x = —q 2 / ( m 2 c 2 ), we can evaluate the integral determining I(q) in Eq. (4.120) as: 


j (<z)= in (A) 


9e 


9e 


\m z c z J 


\m 2 ) 127T 2 127r 2 

Two particular limiting cases of f[x) allow for a simpler expression: 

lim f[x) x/h : lim f{pc) ~ ln(x). 

ai>l 


(4.121) 


(4.122) 


(4.123) 


We are now ready to write down the total amplitude for e — fi scattering, including the fourth order vacuum- 
polarization diagram as a correction , as follows: 




1 


( ln (s?) - <4124) 


12t r 2 


t 

\m 2 6 


Here is the critical step: let us now introduce the renormalized coupling constant 


9R = 9e\ 1 - 


127r 2 


ln(A 2 /m 2 ), 


so that up to O(g 2 ) [we make the approximation g\jg\ = g% for the last term] we obtain 

M = -g 2 R [u(p 3 )^u( Pl )]^[l + ^/(^s)] [u{jpa)i v u{p 2 )\. 


(4.125) 


(4.126) 


Remarkably, this has an identical form as the lowest order diagram except for two differences: 
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1. We have a new effective coupling constant g e qr. It depends on the cut-off A, which is physically a 
spurious effect since we have introduced A by hand as a way to avoid a divergence. Nevertheless, this is 
not a problem in practice because qr can be experimentally measured and has been determined to be finite. 
Therefore, the divergent term is apparently cancelled out by compensating infinities from other higher-order 
diagrams. The main point of the renormalization procedure is that the bare quantity g e is not the physical 
quantity (or equivalently the bare electron charge e): higher-order corrections to the bare quantity gives the 
effective quantity gR (or equivalently e e ff) which is physically measurable. Therefore, we can work with the 
expression for M obtained from the lowest-order diagram so long as we keep in mind that the masses and 
coupling constants we use must be the renormalized ones (whose values can, in some cases, be determined 
experimentally). 

2. There is also a finite correction from the term cx ^2 / (^3 ) • This can also be absorbed into the effective 

coupling constant, in which case we obtain a so-called running coupling constant which means that it now 
depends on energy and momentum (through q): 


rf)=i>»(0)\/l + (4 - 127) 

High momentum q is equivalent to a closer approach between the particles, so that the effective coupling 
constant depends on the distance to other particles (which is physically reasonable due to vacuum polariza¬ 
tion and screening effects). For non-relativistic situations, this is nevertheless usually a small effect. The fact 
that a coupling constant is running has crucial physical consequences as it leads to e.g. asymptotic freedom 
in QCD, which we shall say more about in a little while. 
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V. WEAK INTERACTIONS AND ELECTROWEAK THEORY 

Learning goals. After reading this chapter, the student should: 

• Be able to compute the Feynman amplitude M for weak interaction Feynman diagrams, and also be able to 
obtain decay rates and scattering cross sections from M . 

• Understand which particles interact via charged and neutral weak interactions and how these interactions are 
mediated. 

• Be able to describe how electromagnetic and weak interactions can be combined in a unified framework via 
chiral fermion states. 


We will here establish Feynman rules for charged ( W ± ) and neutral (Z°) weak interactions related to leptons and 
quarks, and treat some important processes in detail. Finally, we gather EM and weak vertices under the same 
umbrella, namely Glashow-Weinberg-Salam electroweak theory, by using chiral fermion states. 


A. Charged leptonic weak interactions 

The mediators of weak interactions are the charged W ± bosons and the neutral Z° boson. Their masses are 

M w = 80.385 ± 0.015 GeV/c 2 , M z = 91.188 ± 0.002 GeV/c 2 . (5.1) 

Since these are massive spin-1 bosons, there are three available spin polarization states (m s = —1,0,1). Their 
propagator has the form 

-i[6W -g^/( Mc ) 2 ] 

q 2 - M 2 c 2 ' K ’ 

Since the boson masses are large, it is often experimentally the case that the momentum transfer satisfies 
q 2 < {Me) 2 . In that case, we may simplify the propagator to simply ig ^ / {M c) 2 . The fundamental leptonic 
vertex and the reverse process are shown below. Note that IU + and W~ are antiparticles of each other. 



As for the Feynman rules, they are the same as in QED except for two things: 

1. The propagator expression is modified since it is massive. 

2. Since the interaction is different from QED, the weak vertex factor reads 

(5-3) 

The extra factor (1 — 7 s ) causes pairty violation. 7^ on its own gives a vector coupling (in QED), while 7 M 7 5 
gives an axial vector coupling, as described in the previous chapter. 


Decay of the muon. 


Download free eBooks at bookboon.com 


61 






INTRODUCTION TO PARTICLE PHYSICS 


WEAK INTERACTIONS AND ELECTROWEAK THEORY 



The amplitude is obtained as 

M = 8( ^ c) 2 [«(3)7 m (1 - l 5 )u(l)][u(4)^(l - 7 5 )i>(2)]. 


Using Casimir’s trick, we find that 


(\M\ 2 )=2(^-)\p lP 2 )(p 3 p,). 


(5.4) 


(5.5) 


Analyzing it in the muon rest frame where p\ = (m^c, 0) we obtain P 1 P 2 = Moreover, p\ = P 2 + P3 +Pa 

from which it follows that 


(P3 + P4 ) 2 = m 2 e c 2 + 2p 3 p 4 = (pi - p 2 ) 2 + tti 2 c 2 - 2pip 2 


(5.6) 


which in turn leads to 


(m 2 - m 2 )c 2 

PoPi = —^- m^E 2 . 


(5.7) 


We have set the neutrino mass m u = 0 and since m e , we can set m e = 0 as well as an approximation. 

Inserting our 4-vectors into the expression for A4 gives: 


^M w c 

We use the Golden rule to calculate the decay rate: 

(|.Ad| 2 ) ( -j-p cd 3 p, 


(\ M \ 2 ) = (jEE) mlE 2 (m^c 2 - 2E 2 ). 


dT = 


2 Hm n 


^ 2=2,3,4 

Here, we have Ei = | p { \c. The p 3 integral is performed: 

(\M\ 2 )c 3 ( d 3 p 2 )(d 3 p 4 ) 


( n (2^:) -P2-P3- Pi)- 


dr. 


16(2n) 5 hm fl E 2 E 3 E 4 


6 (m^c - E 2 /c- E 3 /c - E 4 /c), 


where E 3 = \p 2 + p 4 |c. We may then continue with the p 2 integral. Let p 4 || z so that 

(. E 3 /c ) 2 = \p 2 +P4I 2 = ( E 2 + E\ + 2 E 2 E 4 cos 0)/c 2 . 


(5.8) 


(5.9) 


(5.10) 


(5.11) 


Moreover, d 3 p 2 = (^ )' 2 sin OdOdcp and since there is no <j> dependence, f d(j) = 2tt. To do the ^-integration, 
let x = E 3 /c so that 


dx 


E 2 E 4 sin Odd 
cE 3 


(5.12) 


We then have 

sin OdO 


r sin OdO rr /rn ^ c f x + r/ ^ ^ , , , 

/ — r= — 5[m^c- (E 2 +E 3 + E 4 )/c] = / Sfrn^c — x — E 2 /c — E 4 /c)d. 

Jo -^3 Jx- 


= / E^El if x - < m ^ c - ^ 2 /c - E 4 /c < x+ 

1 0 otherwise. 


(5.13) 
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Here, we defined x± = \E 2 ± E^/c. We can write the above inequality in a physically more transparent way by 
adding (£ 2 + £ 4 ) and dividing on 2 : 

~[\E 2 — £ 4 1 + E 2 + £ 4 ] < -m^c 2 < E 2 + £ 4 . (5.14) 

This is in fact equivalent to three inequalities: E 2 < m^c 2 /2, £4 < m^c 2 / 2, and (£2 + £ 4 ) > m^c 2 / 2. If we 
reflect upon these equations for a moment, we realize that they make sense physically. The maximum energy for 
particle is obtained if it emerges opposite to the two others. Conservation of momentum then dictates that the 
particle picks up half of the available energy. Therefore, a pair of particles (e.g. 2 and 4) must always have at least 
an energy of m^c 2 /2 If there is an angle between the particles, one will always acquire less energy. 



The inequalities dictate the limits for the dE 2 integral: 


dr 


/ fjw \ 4 m n c d :i p 4 
\4irM\yc) h E| 


L 


m^c 2 / 2 


m^c 2 /2-E 4 


E 2 {m fJi c 2 


2 E 2 )dE 2 = ( 


/ 9w \ 

, A m^c 

(m^c 2 2 E 4 \ 

\ 47 tMwc) 

1 h 1 

(2 3 / 


(5.15) 


Finally, setting £ = £4 and then d 3 p 4 = 47 r(£ 4 /c) 2 d£ 4 /c gives us: 
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f gw \ 4 m l E2 , 

fi \ 

dE ' 

\M w c) 2HW) 3 ' 

h 

co 

■C 

Ci 

to 


(5.16) 


This is the energy distribution of the electrons emitted in muon decay, and gives a very good experimental match. 
The total decay rate can be obtained as: 


T = 



(5.17) 


which in turn gives the lifetime r = T 1 . Comparing with the experimentally determined value of r, one obtains 
gw = 0.66. This gives a weak fine structure constant 


OLW ~ 


4tt 29 


(5.18) 


which is roughly five times ckqed- The intrinsic coupling in weak interactions is then large, but the interactions 
still remain feeble at low energies because the propagator is so massive [~ (q 2 — M^c 2 ) -1 ]. For high energies 
(momentum transfer), q 2 ~ M^c 2 , the weak interactions can dominate over the electromagnetic ones. 


Decay of the neutron. 




Neutrons and protons are composite particles since they consist of quarks, but perhaps we can expect that treating 
them as point Dirac particles will give a reasonable approximation? The calculation proceeds in essentially the 
same way as for /i-decay, and the final result is: 

F= 47T3ft (z^c) ( TO e c2 ) 5 [^( 2a4 - 9a 2 - 8) \[a 2 - 1 + aln(a + \fa 2 - 1)], (5.19) 

where a = ( m n — m p )/m e . Note that we cannot here neglect the electron mass, since the rest energy of 
the electron is comparable to the released energy of the reaction, ( m n — m p — m e )c 2 , due to the small mass 
difference between the neutron and proton masses. This is not the case for /i-decay, where the released energy is 
(ra M — m e )c 2 . Putting in the numbers, we obtain r = ^ = 1316 s. The experimentally value, on the other hand, 
is r = 898 ± 16 s. The order of magnitude is thus correct, but there is still a deviation. Considering that weak 
decay processes range from 15 minutes (the neutron) to 10 -13 s, our theoretical estimate is not that bad. But what 
is the reason for the discrepancy? We did assume that p and n are point particles (neglecting their internal quark 
structure) and assumed an interaction with W in the same way as leptons interact with it. At the same time, we 
also know that the Mott formula works very well for e — p scattering mediated by 7, where p is treated as a point 
particle. The crucial question becomes: what is the net coupling strength of the proton and the neutron to W ± 1 

In electrodynamics, all internal complications do not matter because electric charge is conserved. However, we 
do not know that the same is necessarily true for weak interactions. For instance, a gluon splitting to a qq pair 
might make a finite contribution to the effective weak coupling vertex since quarks interact weakly. To account 
for this in the n —)> p + W vertex, we make the substitution: (1 — 7 s ) (cy — (747 s ) where cy is the correction 
to the vector "weak charge" while <74 is the correction to the axial vector "weak charge". Experimentally, one 
finds cy ~ 1.00 and <74 ~ 1.26. The corrected theoretical estimate for the lifetime then comes into much closer 
agreement with the experimental one: r = 914 s. 
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Decay of the pion. 

The process i r - l~ + 9i is actually a scattering event of bound quarks [see Fig. (a)], and can in a sense be 

viewed analogously to positronium decay (e + + e~ —7 + 7). 



Analyzing this decay in the framework of weak interactions, we may represent the scattering as in (b). The dark 
blob represents the (unknown) coupling of 7r - to W~ . Let describe the blob. The Feynman amplitude takes 
the form: 


M = u 


(3)( : 


-1.91V 


2V2 


7 "(1-7 5 )F(2) 




(M w c) 


-pv 


9w 


8 {M wC y 


[w(3) 7 , 1 (l-7 5 )K2)]i ?M . 


(5.20) 


We here absorbed a constant into F^ —)> F^. We know it must be a 4-vector in order to contract 7^, so that M 
ends up being a scalar. Since 7r - has spin 0, F M can only depend on p Therefore, F M = f n p^ where f n is a 
scalar. We now perform a summation over outgoing spins: 




c ~f 

gw \ 2 ’ 

2 

8 V 

Mwc) . 


\U 

f gw \ 

2 1 

. 8 

\M w c) 

. 


Since p = p 2 + P 3 , we have 2p 2 P3 = — m z 2 )/c 2 . As a result, we obtain 

This is as far as we get without specifying what f n is. However, we can calculate branching ratios: 


(5.21) 


(5.22) 


r(7T —> e +9e) WeK- M e) 2 = 1 9S X 10 

r(7T-^ M - + ^) ^K-7) 2 ’ 

Experimental measurements give (1.23 =b 0.02) x 10 -4 , so it is a good fit. 


(5.23) 


However, let us consider if this result is physically reasonable. It appears that 7r seems to prefer to decay into fi in 
spite of the fact that >> m e . This seems to contradict the general rule that decays occur faster the larger the 
mass difference, since there is more phase space available for decay into a lighter particle. There exists, however, 
an explanation. Consider the hypothetical case where m e = 0. If so, then ir~ —»> e~ + v e would be completely 
forbidden for the following reason, i r - has s = 0, so that e and v e have to emerge with opposite spins. This means 
that they have equal helicity. 


e 


7T 




a 
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Since v e is considered to be right-handed in all cases treated here (although nowadays it is known that neutrinos 
have a small, but finite mass, meaning that left-handed antineutrinos should exist as well), e should also be that 
here. However, if m e = 0, then (1 —7 s ) in the weak vertex would only couple to left-handed electrons (just as 
it only couples to left-handed neutrinos). Therefore, the decay to e is strongly suppressed since m e is so small 
compared to the other energy scales. 


B. Charged weak interactions of quarks 

For leptons, a coupling to W± occurs only within a particular generation : 


"e 




v T 

e 

5 


5 

T 


(5.24) 


No cross-terms of the type e —» + W occur. For quarks, the generation structure is similar: 


U 


c 


t 

d 

1 

s 

1 

b 


(5.25) 


but now cross-generational coupling is allowed. For instance, s u + W~ underlies the decay A — )> p + e + v e . 
In 1963, when only the u,d,s quarks were known, Cabibbo sugested that d —>> u + W~ vertex carries an extra 
factor cos 6c, whereas s —» u 4- W~ carries sin 6c- 
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Experimentally, one finds 6 c = 13.1°, so that weak interactions almost respect quark generations. Various decays 
may be classified as follows: 

• Leptonic decays (K~ —>> l~ + v\) 

• Semileptonic decays ( tt~ —> 7r° + e~ + v e ) 

• Non-leptonic decays ( K~ 7r° + i r - ). 

Cabibbo’s idea was very successful for many decay rates, but a problem arose when considering K° /i + + /i~. 

The amplitude should be proportional to sin 6c cos 6c, since the diagram has the following form: 




However, this did not agree with the experimentally measured rate, which was much lower than the prediction. 
The paradox was resolved in 1970 by Glashow, Iliopoulos, and Maiani who introduced a fourth quark c (even prior 
to its experimental discovery. The new quark would interact in the following manner: 
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Now, the diagram for K° —>> /i + + fi~ was almost completely cancelled by the corresponding one where u was 
replaced with c, since that one was proportional to (— sin 6 c cos 6c)- The cancellation was not perfect due to the 
mass mismatch m c ^ m u , leaving behind a small net amplitude. 

The Cabibbo-GIM scheme then suggests the following: in weak interactions, one should replace the physical 
quarks d and s with d! and s'\ 


d' = d cos 6c + s sin 6c , s' = — d sin 6c + s cos 6c- (5.26) 

The kF ±5 s then couple to Cabibbo-rotated states 


u 


u 


c 


c 

d! 


d cos 6c + s sin 6c 


s’ 


—d sin 6c + s cos 6c 


In this way, d u + W carries a factor cos 6c, s u + W carries sin 6c, and so forth. 
Kobayashi and Maskawa (KM) generalied this to three "weak interaction generations" of quarks: 


d! 


Uud 

U us 

Uub 


d 


s' 

= 

u cd 

u cs 

U cb 


s 

(5.28) 

b' 


Utd 

Uts 

Utb_ 


b 



where U u d represents the coupling of u to d (d u + W ). Not all entries are independent, because U must be a 
unitary matrix. 


C. Neutral weak interactions 

The fundamental vertex for these interactions looks as (a) 

(a) 


Z° 

Here, / is any lepton or quark. There is no fermion-mixing, though. The first experimental indication of neutral 
weak interactions was provided in 1973 through the process + e + e shown in (b). We know that the 

coupling of quarks and leptons to W ± is — (igw/2V^)^(l — 7 5 ). This is slightly modified when coupling to 
composite particles, like the proton, but that is due to a contamination from strong interaction. The coupling to Z° 
is similarly given by the vertex factor 




-iffzl - c f A 7 5 )/2. (5.29) 

Here, g z is the neutral coupling constant while Cy A are coefficients depending on the fermion type /. All of these 
numbers are determined by the weak mixing angle , also known as the Weinberg angle, 6w • These relations are 
summarized as follows. 


/ 

cv 

CA 


l 

1 


2 

2 


— + 2 sin 2 6\y 

1 

2 

u , c, t 

\ ■— | sin 2 6w 

1 

2 

d , 5, b 

- \ + | sin 2 6 W 

1 
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This will be motivated later on, as we unify QED and weak interactions into electroweak theory. There is no way 
to compute Ow theoretically in the Standard Model, but its value can be inferred from experiments: Ow — 28.7°. 
Finally, the propagator is: 


-i(flW - %Qu/Mz(?)/{q 2 - M§c 2 ) 


(5.30) 


with M\y = Mz cos 6\y . 


Example 14. Elastic v e — e scattering. 


P3 




The amplitude is obtained using the Feynman rules: 

M = ~7 5 H 1 )][w( 4 )7M(cy-*CA7 5 )M(2)]. (5.31) 

As before, {cy , ca} are the neutral weak couplings for electrons. If we go to the CM frame and assume very high 
energy scattering so that we may neglect the electron mass (its rest energy), we obtain: 

(I M \ 2 ) = 2 (g^) 4 [(cy + c A ) 2 + (c v - c A ) 2 cos 4 (0/2)] (5.32) 

where E is the electron energy (or equivalently the neutrino energy, since we have neglected m e and operate in the 
CM frame) and 0 is the scattering angle. 
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Before After 

Pi P2 

-► ◄- 

v e 

Example 15. 

The differential scattering cross section for this situation can be worked out according to our previous treatment of 
how to relate da/dQ to M in for two-particle scattering in the CM frame: 

%= 2 (?Hi ^tS E2[{cv+cA)2 + (cy ■ c ^) 2cos4 ( 0 / 2 )] (5 - 33) 

so that the total a becomes: 

a= h {nc)2 (2 +c2a+ ° vca) • (5 - 34) 



It is important to note that most (but not all) neutral processes are masked by competing electromagnetic ones. For 
instance, e~ + e + can occur both via exchange of a virtual Z° or 7 . Conversely, there is a weak 

interaction contamination in every electromagnetic process since Z° couples to everything that 7 couples to (and 
more). Even if the effect is small, its smoking gun signature, when it is observable, is parity violation. To access 
weak interactions alone, one has to use neutrinos which do not have any electromagnetic coupling. Alternatively, if 
one is able to access energies so high that q ~ M^c, the denominator of the Z° propagator becomes very small and 
leads to a large interaction that dominates even the 7 contribution. Let us consider an example of such a situation. 


Example 16. e e + scattering near the Z° pole. We are considering the process e + e + f + / where / is a 
quark or a lepton. 




Asume that rrif <C Mz, but let us keep the exact form of the Z° propagator since we are interested in large 
momenta transfers q ~ Mzc. By the way, note that Z° is its own antiparticle. We obtain: 

M = ~ 4[g 2 - g (M z c) 2 ] “ c :a7 5 M3)][sv - Q^/( m zc) 2 ][v{2)Y(c^ - c e A y 5 )u( 1)] (5.35) 

with q = pi +P 2 = P 3 +P4- Since we will consider energies near the Mz mass of 90 GeV, we may safely neglect 
all masses. From the second term, we find that q^ contracts with 7^ to yield 

u(4)rf(c v - c A 7 5 )v(3 ). (5.36) 
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Moreover, since ^ = ^3 + ^4 and u( 4)^ 4 = 0 (this is the Dirac equation for m = 0) in addition to: 

fa(cv - CA7 5 )v(3) = (c v + c A y 5 )fev(3) = 0, 


(5.37) 


we see that the term dependent on q^q u gives no contribution to M.. Performing now the spin summation (Casimir’s 
trick): 


(\M\ 2 ) = 


9 2 z 


I8(q 2 -M|c 2 )J 


Tr {^(4 - c f A ^ 3 Y(c f v - c f Al 5 )M x Tr{7^(cy - c\^)i> llv (c^ - 

(5.38) 


Performing the traces (using rules for 7 -matrices) and integrating over the scattering angle, we find: 


1 


hcg 2 z E 


3tt U [(2 E) 2 - (M z c 2 ) 2 


[(4) 2 + (4) 2 H (4) 2 + (4) 2 ] 


=7 2 


(5.39) 


in the CM frame. We now see that as the total energy 2 E —» M z c 2 , the cross section cr diverges! To counter this 
(a true mathematical divergence can never be observed physically), take into account the finite lifetime r z of Z° 
which modifies the propagator to: 


q 2 - ( M z c ) 2 q 2 - (M z c) 2 + ihMzTz 

where Tz = r^ 1 . To derive why the finite lifetime leads to the imaginary term in the denominator is beyond the 
scope of this course, but as a sidenote it is interesting to be aware of the fact that the same thing happens when 
describing finite lifetimes of quasiparticles in condensed matter systems. In any case, the presence of the term 
oc r z is that it "smears" out the effective mass of the propagator so that the divergence right at the bare mass M z 
is avoided and one obtains 


(hcglEl [(4) 2 + (c7 2 ][(c^) 2 + (c^) 2 ] 

48t r [(2 E) 2 - (M z c 2 ) 2 ] 2 + ( hM z c 2 T z ) 2 ' ^ ' 

The correction from the finite lifetime is negligible except if 2 E ~ M z c 2 , in which case it is crucial since it 
prevents a divergence. Now, the same process mediated by a photon gives: 


(ficff 2 ) 2 (Q-Q 2 
487 tE 2 


(5.42) 


where Qf is the charge of / in units of e. We can actually compare 7 - and Z°-mediated scattering directly via the 
ratio: 


a(e + e —> Z° /i + /i ) 2E 4 

a(e+e- ->7 -^/i+/i“) “ [(2 E ) 2 - (M z c 2 ) 2 ] 2 + ( hT z M z c 2 ) 2 

when inserting for 0w in Cy A and Cy A . Two particular limiting cases are of interest: 


(5.43) 


lim ^,2 


J \Mzc 2 )J 


2 E<£M z c 2 CT 7 \M Z C 2 ), 


< 1 


whereas near the Z° pole 


lim 

—>• MzC 2 (Try 8 V ZlT 2: / 


2 E—yM z 


(5.44) 


(5.45) 


We have used the value hTz = 2.5 GeV. Hence, the weak mechanism is strongly favored near Z° pole. 
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D. Electroweak unification 


Chiral fermion states. 

We would now like to explore where the GWS-parameters, such as cy and c A , and their dependence on the 
Weinberg-angle Ow come from. First, note that Glashow’s original aim was to unify weak and electromagnetic 
interactions as manifestations of one fundamental "electroweak" interaction. An immediate problem arises: if it is 
indeed the same underlying interaction, why is 7 massless while and Z° so heavy? The solution turns out to 
be the Higgs mechanism , but we shall delay a detailed treatment of this to a later chapter in this book. 


There is another issue which also must be resolved in the quest to merge weak and electromagnetic interactions: 
the structural difference between the vertices, namely 7 ^ versus 7^(1 — 7 s ). One way to fix this is to absorb 
(1 — 7 5 ) into the particle spinor itself: 

ul(p) = (1 - l 5 )u(p)/2, (5.46) 

so that the vertex structure becomes the same in QED and weak interactions. In general, ul is not an eigenstate of 
helicity operator in spite of "L" representing "left-handed", i.e. helicity —1. We have: 


7 5 tt0) 


c(p-gj) 

E-\-mc 2 

0 


0 

c(p-OL) 
E—mc 2 


u(p) 


(5.47) 


which is clearly seen to not be proportional to u(jp). We can prove the above equation as follows. First, note that 


7 5 u - 


0 1 


u A 

1 0 


u B 


(5.48) 


Now, use that (ji — mc)u = 0 gives us two equations (the upper and lower components): 


u A 


c 

E — me 2 


(p-a)u B , 


u B 


c 

E + me 2 


(p-a)u A . 


(5.49) 
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Inserted into Eq. (5.48), we then obtain the desired result Eq. (5.47). Now, if m = 0, then we obtain 

7 5 u{p) = (p ■ S)u(p) (5.50) 

where p • T, is the helicity operator with eigenvalues ±1. Thus, for mass m = 0 one has: 


Id _ „a )u(p) = J 0 if <p) has helicit y +1 

2 V 1 u(p) if u(p) has helicity -1 


(5.51) 


It is important to emphasize that this holds exactly only for m = 0, but u R is nevertheless always referred to as a 
left-handed particle. We can think of ^(1 — 7 s ) as a projection operator that picks out the -1 helicity component 
from u(p) for m = 0 . 


For an antiparticle: 

vl{p ) = ^ 7 (p). (5.52) 

For a right-handed counterpart of a particle or antiparticle, let 7 s —)> (— 7 s ). The states ul,r and vl,r are chiral 
fermion states , where the word "chiral" is Greek for "hand" (in reference to their left/right nature). In this way, 
weak and electromagnetic interactions can be expressed in a more unified form. Consider for instance the process: 


W~ 



which contributes with a factor 



(5.53) 


to the amplitude M. Here, v and e represents the spinors. The quantity j is the weak current , and can be viewed 
as an analogue to the electric current in QED. Now, anticommutation of {7 M ,7 5 } = { 7^5 7 5 } = 0 gives that 
7^(1 — 7 5 )/2 = (1 + 7 5 ) 7 Ai / 2 . Also, we have [(1 — 7 s )/2 ] 2 = (1 — 7 5 )/ 2 . Therefore, the relation 


7m 



(5.54) 


This allows us to write the weak current in the manner: 


= "Ll»e L (5.55) 

which is a purely vectorial vertex, but only couples left-handed z/s to left-handed e’s. We can also accomplish 
the same thing in QED (writing the current in terms of chiral fermion states) even if the vertex is already vectorial 
there: 


(l- 7 5 \ /l+7 5 \ 

U = --- ju + ^--- JU = U L + U R , 

so that for instance 


(5.56) 


jj} = -e7M e = - e R -f^e R . 

Cross-terms can be shown to vanish, and we built in a factor (—1) due to the negative electric charge. 


(5.57) 


The main merit of the above formulation is then that the actual vertex may be turned into a purely vectorial one for 
both QED and the weak interactions by allowing the (1 — 7 s ) factor to characterize the particles instead. Again, 
we emphasize that the "L" and "R" notation only represents true handedness for mass m = 0 (and approximately 
for E >> me 2 ). In general, it should be viewed as convenient notation. 


Weak isospin and hypercharge. 

The negatively charged weak current and positive equivalent look as follows. 
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They may be expressed compactly by introducing the notation 


XL 



T + = 


0 1 
0 0 


r 


0 0 
1 0 


so that we may write: 

3* = XlI^Xl- 


(5.58) 


(5.59) 


We see that = (r 1 ±ir 2 )/2 where r 1,2 are the first two Pauli matrices. This is similar to our earlier treatment of 
isospin. In fact, we could envision a full "weak isospin" SU(2) symmetry (meaning that the Lagrangian describing 
these interactions would be invariant under an SU(2) transformation of the fermion spinors: more on this in the 
final chapter) if a third current existed, namely 


il = XlX^t^xl- 


(5.60) 


This is precisely the neutral weak current, which we already do know exists. But what about right-handed couplings 
via 7 , which should also be allowed? To incorporate these couplings and the electromagnetic current into this 
unified framework, we introduce the weak hypercharge current 


jI = 2j'm ED - 2 4- 


(5.61) 


which includes right-handed electrons [to see this, revisit the definition of j^ ED in Eq. (5.57)]. The quantity 
Y equals 2 Q — 2/ 3 where Q is the electric charge in units of e while 7 3 is the third component of the weak 
isospin. Note that is invariant with respect to rotations in weak isospin space (such as ^ vl\ since such 
transformations do not influence right-handed states. 


Note that we are here considering the original leptonic version of the electroweak model, where the right-handed 
neutrino did not couple to any gauge fields as it was thought to not exist, and could thus simply be dropped. 
However, today we know that neutrinos have a small mass (as discussed in our earlier treatment of neutrino 
oscillations), so that the right-handed neutrino should in fact be kept. 


The total underlying symmetry group of the (original) electroweak theory is SU(2 )l® U(l) where U(l) is associ¬ 
ated with the weak hypercharge quantum number. The latter involves both chiralities. The meaning of the above 
statement regarding the symmetry group is that the Lagrangian describing these interactions is invariant under both 
SU(2) transformations (in the left-handed sector) and U(l) transformations of the fermion spinors. This formalism 
can also be extended to other lepton/quark doublets, such as 


XL 


V e 




u 


c 

e 

i 

L 


1 

L 

d! 

1 

L 

s' 


(5.62) 


and so forth. We then formally define three weak isospin currents j and a weak hypercharge current : 

3i_i = ^XlIuTXl, jl = 2jf D - 2 ft, (5.63) 

where 

2 

= Y Qi( U iL'y^iL + UiR'J^UiR) (5.64) 

i= 1 


and the summation goes over particles i in the doublet. 
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E. Electroweak mixing 

The GWS model states the the weak isospin currents couple with strength gw to a weak isotriplet of intermedi¬ 
ate vector bosons W M (a vector in weak isospin space) whereas couples with strength g '/2 to an isosinglet 
intermediate vector boson B This may be mathematically expressed via the following structure, which could be 
included in a Lagrangian describing such interactions: 


Owj, ■ W' J + B" 


(5.65) 


We shall in fact examine Lagrangians describing particle interactions in much more detail in the last chapter of 
this book. Eq. (5.65) contains all the electrodynamic and weak interactions. We may express the first term via the 
charged currents: 


7 • WM = + ftw 1 * < 5 - 66 > 

where (W^ 1 =F i W^ 2 ) are the "wavefunctions" for the W ± particles. The precise couplings to the W ± 

particles can now be read out from the coefficients of Eq. (5.65). Let us look at this in more detail. 


Example 17. Coupling to intermediate vector bosons. Consider e v e + W . This process is described by 
the negative weak current: 


1-7 5 

= VLl^e L = ^— e 


(5.67) 


Inserted into Eq. (5.65), we obtain that 

- 7 5 )e]^“. (5.68) 

Thus, we recover the correct vertex factor — i gw /(2v / 2)7 M (l — 7 5 ). 


What happens in electroweak theory is that the SU(2 )l® U(l) symmetry is reduced to an U(l) symmetry (but 
not in the hypercharge sector, although this is not crucial for this argument) via the Higgs mechanism that occurs 
through spontaneous symmetry breaking. We shall have much more to say about both of these phenomena in the 
next chapter, but for now we only need to know that the implication of this is that the two neutral intermediate 
vector bosons in our theory so far (W 3 and B) mix and produce one massless linear combination (the photon) and 
one massive combination (the Z°) according to: 

A^ = B^ cos 0 W + W 3 sin 0 W , 

Zn = —B^ sin Ow + W 3 cos 6w- (5.69) 

If we write the neutral part portion of the electroweak interaction with the physical A M and Z^ states, we obtain 

[gwj^W^ 3 + = -i {[gw sin Q w j% + (V/ 2 ) cos O w j^\A^ + [g w cos 9 w j^ ~ (V/ 2 ) sin 6WjJ]) 

(5.70) 

so that the previous symmetry in isospin space [SU(2)], made evident by the coupling to the Pauli matrix vector 
r, is now gone. Instead, by comparing this with the known electromagnetic coupling — ig e j^ ED A^, and using that 
7/? ED = + hiv 9 we °b ta i n consistency only if 

gw sin 6w = 9 cos Ow = g e - (5.71) 
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This is the origin of the dependence of the interaction parameters on the mixing angle 6w , and shows that 
the weak and electromagnetic coupling constants are not independent. A similar procedure for Z° yields 
gz = 9e! sin Ow cos Ow as previously announced. We can also read out the vector and axial couplings {cy , ca} 
for neutral weak processes, as we show in the example below. 


Example 18. Reading out coupling constants. Consider the process v e v e + Z°. We can only have a 
contribution from and not j^ ED since neutrinos only interact weakly. The total coupling to Z^ can therefore be 
written as: 


=o 

-i 9z{il ~ sin 2 9 w j^ > )Z> J ‘ 

By inserting we obtain from the above equation 



From this expression, we then read out that Cy = c v A = \ by comparing with Eq. (5.29). 


(5.72) 


(5.73) 


Although we have made a lot of progress here in terms of constructing a unified framework for weak and electro¬ 
magnetic interactions, we still have not provided a detailed explanation of an important step of this task, namely 
how the gauge fields acquire mass. This will be remedied in the last chapter in this book which treats gauge theories 
and spontaneous symmetry breaking. 
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VI. QUANTUM CHROMODYNAMICS 

Learning goals. After reading this chapter, the student should: 

• Be able to compute the Feynman amplitude M for Feynman diagrams in quantum chromodynamics, and 
also be able to obtain decay rates and scattering cross sections from M . 

• Understand how to account for the composite structure of protons and neutrons when treating scattering 
processes via Feynman diagrams. 

• Understand how the color quantum number affects particles in strong interactions and how it is accounted 
for quantitatively. 


All that we have stated so far about electrons in QED applies to quarks as well after substituting (—e) to either 
2e/3 or (—e/3), depending on which quark we are considering. The problem which complicates our observation 
of how quarks behave is that they, in contrast to electrons, never appear freely: we must infer information indirectly 
through hadrons. We therefore start by considering two central examples of hadron-production via e~ — e + and 
e — p scattering. Then, we proceed to develop the framework of QCD: the theory of how colored particles interact. 
We discuss Feynman rules, color factors, pair annihilation, and finally asymptotic freedom. 

A. Hadron production via e - - e + collisions 

Consider the process e~ + e + q + q~. 



e 


e 


Briefly, the quarks escape as "free particles", but when their separation reaches ^10 15 m the strong interaction 
is so great that it triggers a cascade of quark-antiquark pairs. The result is jet formation of hadrons. 




Detector 
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What is observed is then e~ + e + hadrons. However, the fingerprint of quarks in this process is that the hadrons 
typically emerge in two back-to-back jets along the direction of q and q, respectively. One also experimentally 
observes three-jet events, which is indicative of an emitted gluon carrying a portion of the energy. The observation 
of three-jet events is generally regarded as the most direct evidence for the existence of gluons. 


Note that the first stage in the hadronization and jet formation (e 
obtain (evaluated in the CM frame): 


—7 Q + q) is ordinary QED, so we 

r> 2 x2i r /Me 2 n2i 


i (UlC*^^ ' (MC*^ 

~ v^eT/ J Ia^eT J 


s 2 0 


which leads to a total scattering cross section 


_nQ 2 (hca ^2 jl-(Mc 2 /E ) 


2 r 


3 V E ) 


[me 2 IE ) 2 V 


r 1 , 

^Mc 2 \ 2 1 

r 1 , 

nmc 2 \ 2 "| 

[ 1 + 2 < 

,e)\ 

[ 1 + 2 < 



( 6 . 1 ) 


( 6 . 2 ) 


Here, Q is the quark charge in units of e (2/3 for u, c, £, and so forth) while M is the mass of the quark. 

An interesting point is that a becomes imaginary for E < Me 2 . Take a moment to reflect on why this happens - 
clearly, it must be nonphysical. The reason is that the process becomes kinematically forbidden for energies below 
Me 2 : there is not enough energy available to even produce the rest masses of the quarks. On the other hand, for 
high energies E > Me 2 me 2 , we obtain the simple expression 


(7 = 


(6.3) 


7r (hQca\ 2 
3 V E ) ' 

There exists a number of thresholds for the energy as we increase it: first one is able to produce muons, light 
quarks, then (at ~ 1300 MeV) the c quark, r at 1777 MeV, b quark at 4500 MeV, and finally the t quark. A 
prediction for experiments would then be to consider the ratio 


R = 


cr(e e + 


hadrons) 


(6.4) 


a(e e+ —>■ fi /i+) 

Above each threshold, we then have according to our expression derived for a above: R(E) = 3 JA Q 2 where the 
sum is over all quark flavors with threshold below E. The factor 3 is due to the three possible colors for each flavor. 
According to this, we should have what resembles a "staircase" graph with a step up every time a new threshold is 
exceeded. 


‘ R 

C 

b 


(u , d, s) 


/ 





-► 


E 

Comparison between experiment and theory is pretty good, except right next to the actual thresholds. One thing 
missed by our approximations is that the qq pairs are not truly free particles, although we have described them as 
if they were. Therefore, the hadronization process cannot in reality be split into two artificial steps e - e + —» qq 
and then qq hadrons. For instance, it is possible to produce a bound state (such as f = ss or ip = cc) 
where the quarks are strongly interacting and our procedure fails. Such events show up as resonant peaks in a. 
Nevertheless, it is noteworthy that the color factor of 3 is crucial for agreement between experiment and theory, 
and thus constitutes compelling evidence of the color quantum number. 


B. Elastic e-p scattering 

Let us now see how we can probe the internal structure of the proton. If p had no internal structure, we should be 
able to treat it as a point-particle and copy/paste our QED treatment of e — fi scattering. However, since p is not 
a simple point charge, we need a more flexible formalism to account for such scattering. To lowest order in QED, 
we may represent the process as follows. 
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P4 



The dark blob represents the unknown exact structure of the photon-proton vertex interaction. What we do know, 
however, is that the e — e vertex and 7 propagator are the same as in QED. Therefore, we can write generally: 


(I M \ 2 > = ^L^ on K, 


(6.5) 


where the tensor K is, so far, unknown. Since K IU , is a second-rank tensor, it can only depend on three quantities: 
P 2 ,Pa and q. But as q = p\ — p->, only two of these are independent. We choose q and p-> and drop the subscript so 
that p = P 2 is the initial proton momentum. With this in mind, the most general form of reads: 


K» v = -K 1 g llv + 


K 2 


(Me) 


+ 


Ka 


(Me) 


+ 


K* 


(Me)' 


r(pV+pV*)- 


( 6 . 6 ) 


In this way, all scalars K, have the same dimension and are functions of the only scalar variable in the problem: 
q 2 . Note that p 2 = M 2 c 2 and p 2 = M 2 c 2 = (q + p) 2 -tq-p= —q 2 / 2. The K t functions are nevertheless not 
independent of each other. One may show that q^K^ = 0 which in turn leads to 

K 4 = (Me) 2 K\/q 2 + K 2 /4, K 5 = K 2 / 2. (6.7) 


We have then managed to express K liV with two so-called form factors K] = Ki(q 2 ) and K- 2 = K 2 (q 2 ): 
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K^ 


= *1 ( - IT + + 9“/2)(p- + 972). 


(Me) 2 

With this in hand, we may now evaluate (| J\4 | 2 ) in the standard way: 

' ^9e N 2 r 


(|M| 2 ) = [^[(prps) - 2(mc) 2 ] + K 2 [(p lP )(p 3 p)/(Mc) 2 + q 2 / 4] 


Working in the lab frame with the proton at rest, one obtains 

da / ah \ 2 E' 


-( 


dfi \4ME sin 2 (0/2) 


) [2itTi sin 2 ( 0/2 ) + i^ 2 cos 2 (0/2)] 


( 6 . 8 ) 


(6.9) 


( 6 . 10 ) 


where E is the incident electron energy, E' is the outgoing electron energy, while 0 is the scattered angle of the 
electron. The point is then that by experimentally counting the number of electrons scattered into a given direction 
for a specific range of energies, one determines Ki(q 2 ) and K 2 (q 2 ). This gives us a way to model the internal 
structure of the proton. However, a complete theory should be able to calculate what K\ and K 2 are. In the 
simplest model (proton as a point charge), one obtains K\ = — q 2 and K 2 = (2 Me) 2 . This is fine at low energies 
where e never gets close enough to see the inside of the proton. It fails dramatically at high energies, since the 
proton has a rich internal structure which comes into play at close enough distances. 


C. Feynman rules for QCD 

QCD is the interaction between colored particles via gluons. The strong coupling constant is gs = Vera's, so 
that as sets the strength of the force. Put simply, we may think of gs as the fundamental unit of color. Now, to 
specify a quark state requires both a Dirac spinor u^ s \p) and a there-element column vector c which provides the 
color state. We have c = [1, 0, 0] T for red, [0,1, 0] T for blue, and c = [0, 0,1] T for green. Let q, i = 1,2,3 run 
over quark color indices. We can then have processes such as: 



where a red quark turns into a blue quark and a gluon carries off the missing color. Initially, we may thus expect 
nine types of gluons since there are 3 x 3 combinations of color and anticolor (rf, rb,.. .). These states can be 
represented as a color octet and a color singlet, which is a compatible representation with regard to the SU(3) 
symmetry that QCD is based on (more on this later). Specifically, the octet states read 

|1) = (rb + bf)/\/2, |2) = — i(rb — br)/y/2, |3 ) = (rf — bb)/y/2, |4) = (rg + gr)/V2, 

|5) = —i(rg — gf)/V%, |6) = (bg + gb)/V2, |7) = —i (bg-gb)/V2, |8) = (rf + bb - 2gg)/V&. (6.11) 

while the singlet state is 


|9 ) = {rr + fo + gg)/y/Z. (6.12) 

All types of gluon combinations can be obtained using these nine states. For instance, a rb gluon is obtained by 
(|1) + i|2))\/2. Note that the state |9) is invariant under SU(3) transformations. That is why it is called a singlet, 
analogously to a S z = 0, S = 0 spin-singlet state which is invariant under SU(2) spin rotations. We must now 
make a minor, but important modification compared to what we have stated initially in this book: 


All naturally occurring particles are colorless color singlets. 
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A color singlet is not the same as what is meant by a colorless particle. For instance, |3) and | 8 ) are colorless in 
the sense that they have equally much of color and the corresponding anticolor. However, they are not singlets. 
To see this, consider an analogy from spin. S z = 0 does not imply S = 0 : for instance, the m = 0 triplet state 
(tl + It)/ a/ 2 has S z = 0, but S' = 1. On the other hand, S = 0 does necessarily imply S z = 0. One can think 
of 19 ) as an invariant under color rotations in the same way as r 2 = x 2 + y 2 + z 2 is invariant under spatial rotations. 

Therefore, octet gluons do not appear as free particles, because they are not singlets. But if |9) existed, it should 
appear not only as a mediator but also as a free particle. Since gluons are massless, the existence of a free singlet 
gluon would imply that a long-ranged strong force exists. However, this has never been observed and we may 
conclude that only eight gluons exist. Strictly speaking, there are additional details regarding the gluon mass 
in relation to the so-called trace anomaly in QCD, which we do not go into here. The conclusion nevertheless 
remains that a free singlet gluon has not been experimentally observed. 


In order to quantitatively describe gluons, let us start by noting that they are massless spin-1 particles, so that we 
can set the polarization vector e^ orthogonal to the momentum p^. Let us, as before, use the Coulomb gauge: 
6 ° = 0, e • p = 0. Note that selecting a particular gauge spoils Lorentz covariance: doing a Lorentz transformation 
now demands that we also do a belonging gauge transformation to restore the Coulomb gauge. As always, the 
gauge-choice does not ultimately affect the physics: it is just a matter of mathematical convenience. 

To describe the color of the gluon, we use an eight-element vector a: 

a = [1,0,0,0,0,0,0,0] T for |l),a= [0,0,0,0,0,0,1,0] T for 17), and so forth. (6.13) 

Gluons couple to each other since they carry color (unlike photons that do not carry charge). 




Before stating the Feynman rules for QCD, we need to establish some notation (precisely as we also did for QED 
before stating the rules). The first thing is to introduce the Gell-Mann matrices. These are to SU(3) what the Pauli 
matrices are to SU(2): 



’0 1 0 " 


"0 -i 0 " 


"1 0 0 


"0 0 1 " 

A x = 

1 0 0 

, A 2 = 

i 0 0 

, A 3 = 

0-10 

, A 4 = 

0 0 0 


0 0 0 


0 0 0 


0 0 0 


1 0 0 


’ 00 - 

i 



"0 0 0 " 


’ 000 " 


"100 

A 5 = 

0 0 0 

, A 6 = 


0 0 1 

, A 7 = 

0 0 -i 

, A 8 = 

0 1 0 


i 0 0 



0 1 0 


0 i 0 


0 0-2 


We also need the commutators of A-matrices which define the structure constants f af3l of SU(3) via the relation: 

[A“, A^] = 2i/°^ 7 A 7 (6.15) 


where summation over 7 from 1 to 8 is implied due to the repeated index. The structure constants are antisymmet¬ 
ric, so that = —f a ^/3 = — juft With notation in hand, we can now write down the Feynman rules of QCD. 


External lines. A quark or antiquark with momentum p , spin s , and color c is represented as follows. 

Quark Antiquark 


Incoming — 

—-►-• u^ s \p)c 

Incoming - 

-• v^ s \p)c [ 

Outgoing •— 

-►- u^ s \p)c f 

Outgoing * -s$i 

- y( s \p)c 
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A gluon with momentum p, polarization e, and color a is written as follows. 

a, p 

Incoming e ^{p) a 

a, p 

outgoing <»( aa y 

Propagators. 


Quarks & Antiquarks 

q 

i{q^P^+mc) 

q 2 —m 2 c 2 


q 

ig^S a P 

0 

Gluons 



a,/i /3,v 

q z 
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Vertices. Each vertex introduces a factor as follows. 


Quark-gluon 



Three-gluon 



-gsf a01 [g^(ki - h)\ + g v \{k 2 - h)p + g X p(k 3 - fci)„] 
(notice how gluon momenta point into the vertex) 


Four-gluon 



Apart from these rules, the same applies as in 
examples of this framework in action. 


-igl[f-Prf^ { g A g vp _ g ^g vx) + 

{9^9\p - 9p\9up) + f^f^ig^gux - 9p V g\ P )] 
(sum over q implied) 


QED. In what follows, we shall have a look at some concrete 


D. Color factors 


Consider the interaction between two quarks to lowest-order QCD. The focus will be on saying something on 
the effective potential between quarks, analogously to the QED Coulomb potential. Keep in mind that we are 
effectively doing a perturbation theory calculation, assuming that as is small. Because of this, however, we cannot 
hope to obtain the result of asymptotic freedom at lowest order since we should only be able to describe the short- 
range potential behavior where the strong interaction is much weaker in magnitude compared to large separation 
distances between the quarks. We will discover a central result (to be shown quantitatively below): 

Quarks attract most strongly when they are in a color singlet configuration. 

Quark & Antiquark. 

Consider then qq scattering with different flavors (such as u + d u + d). The lowest-order diagram looks as 
follows (the small arrows emphasize the direction of momentum flow). 



The amplitude is obtained according to the rules: 

M = i[u(3)4] [— i<7sA a 7 A1 /2] [u(l)ci] [-ig^S aP /q 2 } [v(2 )c\] [~ig s A'V^] fa(4)c 4 ] 
= -^[“(3)7 A1 w( 1 )][w(2)7^(4)](4A a ci)(4A a c 4 ). 


(6.16) 
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Now, the structure is identical to e — e scattering except for (1) g e gs and (2) the additional color factor 
f = \ (cjA^ci) (<4 A°X 4 ). Thus, we can state that the potential describing qq interactions is the same as in QED for 
opposite charges when replacing a with fas, so that we can at least phenomenologically introduce the potential: 


V q ~ q (r) = - 


fashc 

r 


(6.17) 


The factor / depends on the color state of the interacting quarks: we can create a singlet state or one of the color 
octets. In the latter case, all octets actually turn out to yield the same /. Let us compute this one to begin with and 
choose the combination rb to be concrete. Color is conserved overall, and thus the incoming q has the same color 
as the outgoing q, namely red. Conversely, the incoming and outgoing q have antiblue color. Therefore, 



1 


i- 

O 

i_ 

Cl =c 3 = 

0 

5 c 2 — c 4 — 

1 


0 


0 


In turn, this produces the following color factor: 


(6.18) 


/= i([l,0,0]A“[l,0,0] T )([0,l,0]A«) = (6.19) 

Only the A 3 and A 8 matrices have non-zero entries in the (11) and (22) positions, so that / = — 

We can now repeat the same procedure for the singlet state ^(rf + bb + gg), which gives / = |. We have then 
obtained: 


Vtff(r) = 


— | for color singlet 
l ashc f or C( q or octet. 

o r 


( 6 . 20 ) 


It is clear that the force is attractive for the singlet state, which helps to explain why qq bindings (mesons) occur 
as color singlets but not octets (which would have produced colored mesons). 


Quark & quark. 

Consider again different flavors (e.g. u + d —» u + d). The lowest-order amplitude for such scattering is: 


M = - 


9 s L~, 


V 


■u(3)7 m m(1)] [u(4)7 m u( 2)] (d, A"ci) (c^A“c 2 ). 


( 6 . 21 ) 


The procedure is then similar to e — g scattering except for (1) g e gs and (2) the additional color factor / = 
(c\ > \ cx ci)(c\\ cx C 2 ). The effective potential becomes formally equivalent to that of like charges in electrodynamics: 


V qq (r) = 


fashc 

r 


( 6 . 22 ) 


However, the color configuration for two quarks cannot be singlet. Instead, we have 


Triplet (antisymmetric) 


(rb — br)/V2 
< (bg-gb)/\/2 
(gr-rg)/V2 


Sextet (symmetric) : 


'rr, bb,gg 
(rb + br) / \f2 

(bg + gb)/V% 
Sgr + rg)/V 2 


(6.23) 


(6.24) 


Performing the calculation as before for the color factors, we find 


V q q(r) = - 


2 ashc 

3 r 


for the triplet state, V q q(r ) = 


1 ashc r t 

-for the sextet state. 

3 r 


(6.25) 
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The different sign is obvious, but at the same time we know that neither occurs in nature in spite of the attractive 
interaction for the triplet state. To understand this, we must realize that the short-range attraction does not 
prove that binding occurs: for that, we would have to know about the long-range behavior for the same color 
configuration. The above result nevertheless has important consequences for the binding of three quarks where 
you can show that complete mutual attraction between three quarks is obtained only in a singlet configuration. 


E. Briefly on asymptotic freedom 


In the QED chapter of this book, we found that loop diagrams of the type caused the effective charge of the electron 
to be a function of the momentum transfer q\ 


a(q) = a(0) 


1 + 


a (o)i / M 2 

37r \(mc) 2 


) 


(6.26) 


for \q\ 2 {me) 2 . The physics in this case is that as the charges get closer to each other (larger \q\ 2 ), the coupling 

strength increases due to the vacuum polarization. The screening is less effective at shorter distances. We recall 
that this type of diagram also introduces a divergent term that we "soak up" in an effective charge (which is what 
is experimentally measured). Now, the above equation holds to order 0{[a(O)] 2 }. The contribution from higher- 
order diagrams such as: 




can be performed a summation over explicitly and provides the result: 


a(q) 


a{ 0) 

!- [«(0)/37r]ln(^§i) 


(6.27) 


We can understand why the expression looks like this when taking into account higher-order loop diagrams since 
we obtain a series of the type 1 + x + x 2 + ... = where x represents a bubble. 


Now, much of the same thing happens in QCD as well where q — q bubbles screen quark color and gives (modulo 
color factors /) the same as Eq. (6.26). However, QCD has a twist that is not present in QED. There are also 
virtual gluon bubbles due to the gluon-gluon coupling: 



The gluon contribution actually works in the opposite direction as the q — q bubbles and thus produces antiscreening 
or "camouflage". The formula for the running coupling constant in QCD reads: 


«s(M 2 ) 


_ «s(M 2 ) _ 

l + M^)/127r](lln-2/)ln(^)’ 


M 2 » 


(6.28) 


Here, n is the number of colors and / is the number of flavors. In the Standard Model, we know that n = 3 and 
/ = 6. As a result, lln > 2/ and the coupling constant as decreases as \q\ 2 increases. This means that short 
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distances, the strong interaction in fact becomes quite weak in magnitude. This is an important consequence of 
asymptotic freedom —)> our license to use perturbation theory (Feynman diagrams) for interquark potentials. 

But we have not specified what (i is yet. In electrodynamics it is natural to define the charge of a particle as its 
long-range (fully screened) value. We could, however, use the effective charge at any \q\ 2 as the reference value so 
long as a(\q\ 2 ) is small there (so that we are allowed to use Feynman perturbation theory). The problem in QCD 
is that as is large when q 2 —>> 0, so we cannot use that as our reference point. Instead, we use asifi 2 ) < 1 as the 
"bare” strength of the coupling constant which we base our perturbation expansion on. The quantity fi is therefore 
chosen so that as satisfies this. Note that as(\q\ 2 ) varies substantially over the experimentally available energy 
range whereas a(\q\ 2 ) varies much less. 
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VII. RELATIVISTIC FIELD THEORY AND GAUGE THEORIES 

Learning goals. After reading this chapter, the student should: 

• Be able to derive the equation of motion for a relativistic field Lagrangian C and interpret the terms present 
in C physically. 

• Be able to identify the Klein-Gordon, Dirac, and Proca Lagrangians. 

• Understand the concepts of local gauge invariance, spontaneous symmetry breaking, and how these are 
related to the Higgs mechanism. 


We here assume familiarity with the Lagrangian formulation of classical particle mechanics (see e.g. the free 
textbook "Introduction to Lagrangian and Hamiltonian Mechanics" available to download from Bookboon), and 
develop Lagrangian field theory. Then, we proceed to introduce the fundamental concepts of local gauge invari- 
anec, spontaneous symmetry breaking, and the Higgs mechanism. 


A. Lagrangians in relativistic field theory 

The first issue to address is fundamental: what is a field? When we discuss particles, we think of localized entities. 
In classical physics, one is typically interested in identifying the position x = x(t) of a particle as a function 
of time t. On the other hand, a field occupies some region of space and time, and we are typically interested in 
identfiying the value of the field fa = y , t ) at a given position in space and time. Temperature or electric 
potential are classical examples of fields. Here, we shall use fields to describe relativistic particles. 

In field theory, one starts with a Lagrangian density C which is a function of fa and its derivatives 




dfa 

dx^ 


The Euler-Lagrange equation reads 


/ DC \ = d£ 


1,2,3,... 


(7.1) 


(7.2) 


These equations resemble their equivalents in classical, non-relativistic mechanics, but a key difference here is that 
space and time coordinates are treated equally due to the special theory of relativity. 


Example 19. Klein-Gordon Lagrangian. This C describes a scalar spin-0 field: 

£ = - !(' mc/h) 2 (t > 2 . 


To compute the corresponding equation of motion for the field <j), i.e. Eq. (7.2), we first observe that 


dC 

d(d^) 




To see this, note that we can write the Lagrangian as 

£ = ^(dofidoc/) - duh&Kh. ~ ■■■) 


(7.3) 


(7.4) 


(7.5) 
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from which it becomes clear that 


Moreover, we note that 


d 

d(d 0 4>) 


= d 0 4> = d°(j>, 


d 

d(di4>) 


-di4> = d l 4>. 


dC me 2 

si = —jr*- 


(7.6) 


(7.7) 


The total equation then takes the form: 


d^d^cj) + ( mc/h) 2 (j) = 0 

which is precisely the Klein-Gordon equation we have discussed previously in this book. 


(7.8) 


In the same way as shown in the above example, one can show that from the Dirac Lagrangian which describes a 
spin-1/2 field: 

C = — ( me (7.9) 

that the Euler-Lagrange equation provides the Dirac equation: 

— ( mc/h)vp = 0. (7.10) 

Here, ^ and $ are treated as independent fields. Moreover, the Proca Lagrangian describes a massive vector 
(spin-1) field: 

£ =-i + < 7 '“> 


and leads to the Proca equation 


<9+ (me 2 /h)A v = 0. (7.12) 

In the special limit of a massless field m = 0, this reduces to Maxwell’s equations for empty space. A source term , 
representing charge and current densities, can be added by writing 

c = (7.13) 

167r c 


which provides the equation 


d = — J u . (7.14) 

c 

Note that it follows that d v J v = 0: the current inserted must satisfy the continuity equation. 

Now, where do all the above Lagrangians come from? We have simply written them down without any justification. 
In fact, they were all chosen so that they would provide the correct field equations for their corresponding fields. 
Therefore, in contrast to deriving the classical, non-relativistic L = T — U where T is kinetic energy and U is 
potential energy, C is axiomatic in relativistic field theory. Similarly to classical, non-relativistic Lagrangians, C 
is not unique. We can add a simple constant or a divergence without affecting the equation of motion Eq. 

(7.2) for the field. 
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B. Local gauge invariance 

The Dirac Lagrangian is invariant under the transformation pj — >■ e l0 2 p. This is a global transformation as 9 is 
independent on position and time. Moreover, 9 is a real number here. Therefore, since ijj —>• e~ l0 2 p, all exponential 
factors cancel due to the fact that it is the combination pj'ip that appears in Chirac- When 9 = 9(x), the above is a 
local gauge transformation. The phase-factor is usually thought of as a mere convention: as known from quantum 
mechanics, the phase of a wavefunction is unobservable, and only the difference in phases between wavefunctions 
may be observed. A local gauge symmetry just means that we should be able to freely change the convention 
(reference point) of the phase at any point in space-time without changing the physics, and not only change the 
reference point globally (i.e. by equally much at each point in space-time). However, there seems to be a problem: 
^Dirac is not invariant under a local transformation ip ipe 10 ^ . Instead, it transforms as C C — hc(d fl 9)'ipj fJ "ip. 
We define X(x) = — ( hc/q)9(x ), so that when pj e - iq K x )/ hc ^^ we obtain 

£-►£+(# 7^)^A. (7.15) 

We see that there is a way in which to reobtain local gauge invariance, namely if we add a term to C which cancels 
the extra contribution obtained during the local gauge transformation. Assume that 

C = [ihc'ip^^d^'ip — me 2 'ip'ip\ — (q'ip'Y tJ 'ip)A /JL , (7.16) 

where A M is the so-called gauge field. If the fields now transform like: pj e ~ iq H x )/ hc p; and A^ + A, 

then C is invariant under such a local gauge transformation. Whereas this mathematically restores the gauge 
invariance, we still have to physically justify this procedure. This can be done by realizing that the term qTp'y^'ipA^ 
describes the coupling between the vector field A M and the fermion field pj. If A^ is present, however, there should 
in all fairness also exist a "free" term that describes A M in C irrespective of whether or not a fermion field ip is 
present that it can couple to. This can be accomplished via the Proca Lagrangian with the mass term set to zero: 

C = -F^F^f 16tt (7.17) 

since F^ v is invariant under the gauge transformation in question (whereas a mass term oc A v A v is not). As a 
result, we arrive at the important conclusion that: 


Imposing local gauge-invariance on the Dirac C , we must introduce a massless vector field A^. 
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The full Lagrangian describing a spin-1/2 fermion field coupled to a massless spin-1 boson field then reads: 


C = — me 2 'ip'ip\ + 


_ pu» p 

16tt 




We identify A M as the electromagnetic vector potential since: 

1. ^ + d^X leaves F^ v invariant. 

2. The two last terms in the above C give the Maxwell Lagrangian with a source term = cq^y^'ip), and 
this is precisely the current produced by Dirac particles. 


We have therefore identified a Lagrangian which describes electrodynamics through the coupling between Dirac 
fermions and photons, in particular the current J fi produced by the former. Remarkably, the origin of this coupling 
was that we demanded local gauge invariance. 


The main difference, mathematically, between global and local gauge transformations arises due to derivatives of 
the field. Let 


(7.18) 

to convert a global gauge invariance to a local one. The above relation is known as minimal coupling. Note that 
the gauge field here must be massless. In terms of symmetry, we have a local U(l) gauge symmetry since e 19 
belongs to U(l). The mathematical structure here may be extended to a Lagrangian with two spin-1/2 fields Vh 
and -02 > in which case we obtain a local SU(2) gauge symmetry. This is an example for the so-called Yang-Mills 
theories which we shall treat in more detail later. 

Before moving on to discussing how various terms in C can be interpreted physically in general, let us formally 
define precisely what is meant by a gauge theory : 


A gauge theory is a field theory described by a Lagrangian C that is invariant under a group of local continuous 
transformations. The mathematical procedure which adjusts the degrees of freedom related to the invariance 

of C is referred to as the gauge. 


C. Interpreting C: importance of the mass term 

In general, C consists of two kinds of terms. 

• Free Lagrangian for each field (£q)- 

• Interaction terms for the fields (£j nt ). 

£int may be obtained by invoking local gauge invariance, as we showed in the above treatment of the Dirac and 
Proca Lagrangians. Local gauge invariance thus provides a way to identify the vertex couplings in a theory, and 
it works beautifully for strong and electromagnetic interactions. However, in the weak interactions the gauge 
field is far from massless. It is nevertheless still possible to create a gauge theory with massive gauge fields via 
spontaneous symmetry breaking and the Higgs mechanism. Before exploring these phenomena, we consider how 
to identify a mass term in C in the first place. 


Example 20. Identifying the mass term. What is the mass term in the following Lagrangian? 

£=\(d^)(d^)+e~^ 2 . 


(7.19) 
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Let us expand C around the point 0 = 0: 

1 


c = -{d^){d^) 


1 — a 2 6 2 H— a A (b A — ... 


(7.20) 


Since constants do not affect the equations of motion for the field (although constants do contribute to the so-called 
cosmological constant), we discard them and thus identify the mass term from the 0 ( 0 2 )-term: 

\/2ah 


m = 


(7.21) 


The higher-order terms [0(0 4 ) and higher] represent couplings of the type: 





However, we cannot always simply extract the mass from the term which is second order in the field. An exception 
is for instance: 

£ = + \p<t> 2 - Ja 2 </> 4 . (7.22) 

The sign is the problem here: it looks like the mass should be imaginary, since the sign is opposite to the previous 
examples we have encountered. 


To resolve this problem, we have realize that Feynman calculus using field Lagrangians is really a perturbation 
theory starting from the ground-state. The higher order terms (0 4 , 0 6 , ...) then represent higher order corrections 
to the ground-state. But this is because, so far, the field configuration that gives the minimum energy has been 
0 = 0. In Eq. (7.22), this is no longer the case. We thus have to rewrite C in terms of deviations from the ground- 
state. To do so, we first have to identify the value of 0 which gives the ground-state. We can do so by conjecturing 
that C — T — U (just like in classical, non-relativistic Lagrangian theory), where 

T=\(d^)(d^) (7.23) 

is the kinetic energy while 

U = + ^A 2 </> 4 (7.24) 

is the potential energy. We then see that the minimum of £7(0) occurs at 0 = 0 m = ±p/\. We introduce a new 
field 77 as the deviation from the minimum value: 

77 = 0 ± p/X (7.25) 

and then express C in terms of the new field 77 : 

£ = \{d^){d^) - ± nW - + 1(m 2 /A) 2 . (7.26) 

Now, we can identify the mass term since the second order term in 77 has the appropriate sign: 


y/2 pfl 

m = - 

c 

and the other terms in C are couplings of the type: 



(7.27) 


Note how the C s expressed with 0 and 77 represent exactly the same physical system - we have only changed 
the notation. However, the key insight here is that the 0-version is not suitable for Feynman calculus since a 
perturbation series in 0 would not converge (as 0 = 0 is not the ground-state). 
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The conclusion is then that in order to identify the mass term, we must do the following: 

1. Locate the ground-state. 

2. Express C as a function of the deviation 77 from the ground-state. 

3. Expand C in powers of 77 : the mass term comes from the rf term. 

What about linear terms in £? 

Assume that we have a Lagrangian of the form: 

c = - acf 1 - 6^ 3 - C(j). (7.28) 

Introduce the field 77 = — fio and write 

£ = I(< d^id^rj) - Arj 2 - Brf . (7.29) 

Now identify {A, B, </> 0 } so that 

act) 2 + bfi 3 + ccj) = Arf + Brf. (7.30) 

This gives us three equations: one for each order of Therefore, a linear term can simply be absorbed into a 
new field (77 above) which describes the deviation from some constant value of the original field (<; j> in the above 
example). However, note that 77 = 0 is not necessarily the ground-state of the field: in fact, in general it will not 
be. So even though we can drop the linear term without loss of generality when considering a generic form of £, 
we must pay attention to how it modifies the constants for a specific theory so that we may correctly identify the 
ground-state around which the field is expanded ( e.g . in the potential energy). 
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D. Spontaneous symmetry breaking 


This phenomenon refers to a situation where the Lagrangian (or equations of motion) have a given symmetry, 
whereas the ground-state does not share the same symmetry. Let us borrow an example from condensed matter 
theory, namely a ferromagnetic material. In the simplest model, the Lagrangian (or Hamiltonian, if you prefer) 
only depends on the magnitude of the magnetization M = | M |. Therefore, it does not matter energy wise in which 
direction the magnetization points in such a material: all states are equivalent. However, as soon as a material 
becomes magnetic, the magnetization has to point in some direction. In other words, the ground-state has chosen 
one particular solution out of all the available ones that have equal energy. In this way, the ground-state has 
lowered the symmetry that the original Lagrangian had. The word "breaking" could therefore in some sense be 
substituted by "choosing": spontaneous symmetry breaking occurs when choosing one particular ground-state out 
of many available. 


Example 21. Mexican hat potential. Consider the Lagrangian for a two-component field (with components </>i 
and 0 2 ): 

£ = ^+ 2 (^a02)(^ m 02) + 2^ 2 (^i 0s) — |^ 2 (0i + 0 2 ) 2 - (7.31) 


This is invariant under rotations in (</>i, 02)-space. This means that we have an SO(2) symmetry: C is invariant 
under the transformation 


01 

-A 

0i' 


cos 0 

sin# 

01 

02 


02 


— sin# 

cos# 

02 


We recognize the rotation matrix in 2D, which belongs to the SO(2) group. The minimum of the potential in C is 
given by the equation 



(7.33) 



Thus, we in fact have an entire circle of minima. In order for us to be allowed to use Feynman calculus, we thus 
expand around the ground-state. This means that we must choose one particular ground-state on this circle. Note 
that the ground-state is often referred to simply as vacuum for a given model. The physics should not depend on 
which ground-state we choose, so we might as well take 

0m,1 = MA> 0m,2 = 0. (7.34) 

The fluctuation fields are then p and £ where: 

V = 0i -mAA = 02- (7.35) 

We rewrite C in terms of these to obtain: 

^ = [\(d, v )(d^) - mV] + [\(d,t;)(d^)} 

+ [ M A(A + - L (A + A + 2AC 2 )] + (7.36) 

The terms can now be interpreted as follows. 
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• 1st term: free K-G Lagrangian with mass m v = y/2ph/c. 

• 2nd term: free K-G Lagrangian with mass m £ = 0. 

• 3rd term: five coupling terms of the type 



• 4th term: irrelevant constant. 

The Lagrangian is no longer symmetric in any of the fields since we have chosen to express in terms of one 
particular solution. One of the new fields, £, is actually massless. 


The above is an example of a more fundamental theorem, namely: 

Goldstone’s theorem: Spontaneous breaking of a continuous global symmetry always generates one or more 

massless scalar (spin-0) particles, called Goldstone bosons 

This does not seem very helpful for our purpose - we were looking for a way to generate massive gauge fields, 
not massless ones. As we shall now see, the gauge fields become massive when we apply spontaneous symmetry 
breaking to a local gauge invariance rather than global gauge invariance. 


E. Higgs mechanism 


Let us first make the notation a bit more convenient 

0 = 0 1 +i&-^> = $ + $. (7.37) 

Then, the Lagrangian we have considered takes the form: 

£ = + ^ 2 (<f0) - ^A 2 («f0) 2 . (7.38) 

This C is invariant under U(l) phase transformations 0 —>> e l6> 0. Note that there is no contrast between this 
result and our previous statement that the Lagrangian had an SO(2) symmetry. The reason for this is that the 
groups SO(2) and U(l) are isomorphic (see our previous treatment of symmetry groups), so we have not changed 
anything but notation. 


We can now make C invariant under a local U(l) gauge, by introducing a massless A M field and a minimal coupling 


d/u (7.39) 

Then, the following C has a local gauge symmetry: 

c = \ [{d » ~ l ic A ^ W]W + 1 ~ (7 - 40) 

If we again define the fluctuation fields rj = (j>i — p/X and £ = 02, we obtain 


£ = [^^{d^ri) 


M V] + + 




167r 


^ + 2\hc\J^ 


+ interaction terms. 

(7.41) 
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The gauge-field A M is now massive: tra = 2y / 7r(^) 2 . However, we still have the massless Goldstone boson £ 
present. Moreover, there are strange interaction terms oc (3 M £) suggesting a conversion between £ and A^. We 
see that both of these problems are related to the £ = 0 2 field. Interestingly, we can remove this field completely 
by exploiting local gauge invariance. 

To see this, let 


0 4>' — (01 c — 02<s) + i(01 S + 020) 


(7.42) 


so that if we choose 6 = — atan(0 2 /0i), then <fi f 2 = 0. Since C has the same form expressed with (0', A f ) as it does 
with ( 0 , A^) (that is what gauge invariant means per definition), we now have obtained £ = 0 in this particular 
gauge and C takes the form (where it is assumed that A< u has been gauge-transformed accordingly): 


C = 
+ 


[2 “M T) 


_ TPW TP 

16tt 




lf£V 

2 \hc\) 

(lYrtW) + - A m X - JA 2 ,* + (y 2 . 


(7.43) 


Again, the £’s in these two gauges describe exactly the same physics, but the new choice of gauge gives us an C 
[Eq. (7.43)] that is easier to interpret. We are left with a single massive scalar (the Higgs particle) and a massive 
gauge field Physicists like to say that A M "ate" the Goldstone boson £ and thus acquired mass. This can also 
be thought of as A M acquiring a longitudinal polarization (since it now massive), which is a new degree of freedom. 


The Higgs mechanism has then allowed for the gauge boson A^ to become massive and this is what happens in 
the Standard Model where a spontaneous symmetry breaking of the Higgs field gives mass to W ± and Z°, while 
leaving the photon massless. The spontaneous symmetry breaking of the Higgs field gives mass not only to the 
gauge fields, but also to the fermion fields. The latter couple to the Higgs field via a so-called Yukawa coupling. 
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This is necessary in order for the fermion fields to acquire mass, because in the electroweak Lagrangian there can 
be no bare mass-terms for the fermions of the type m'lp'ip = m(ipLipR t + since these would explicitly 

break the SU(2)£ symmetry. The spontaneous symmetry breaking of the Higgs field demotes the electroweak 
symmetry group SU(2U(l)y to a U(1)em symmetry (which can be seen by writing out the new Lagrangian 
after the symmetry breaking), so that one is left with only one massless gauge field (the photon) in agreement with 
experiments. In effect, there is good reason to believe that fundamental interactions (such as weak, strong, and 
electromagnetic interactions) can be described by local gauge theories. 

Note that "symmetry breaking" is often used in the literature without specifying if one refers to explicit or 
spontaneous symmetry breaking - it is supposed to be clear from the context. This might not always be so and for 
clarity we state what the difference is here. 

Explicit symmetry breaking: A Lagrangian (or Hamiltonian) contains terms that exclude a certain symmetry 
operation. For instance, a magnetic field applied in the ^-direction breaks spin rotation symmetry since spin 
pointing in the z direction now have a different energy than e.g. spins pointing along the x direction. 

Spontaneous symmetry breaking: A Lagrangian (or Hamiltonian) is invariant under a certain symmetry oper¬ 
ation, but the ground-state in which the same system resides does not share that symmetry. For instance, in the 
absence of a magnetic field the energy of a ferromagnetic material is independent on the direction of the magneti¬ 
zation. However, once the material turns ferromagnetic the magnetization points in a given direction and thus has 
broken the original symmetry by selecting one particular realization of the ground-state. 


F. Yang-Mills theory 

Instead of looking at the details of the full electroweak Lagrangian, we give an example of a simpler SU(2) 
invariant Lagrangian and its belonging gauge fields. This will allow us to illustrate similar physics of the 
electroweak theory, but in a technically more transparent way. In particular, we will recover the coupling structure 
used in GWS theory (j • W^) and see that three gauge fields are required (unlike the single gauge field required 
in QED). Yang-Mills theory is also known as non-Abelian gauge theory because its Lagrangian has an SU(2) 
symmetry. In turn, SU(2) is a non-Abelian group since matrices do not in general commute. 

Suppose that we have two spin-1/2 fields, Vh and ^ 2 - This is just like in the weak isospin doublet case, xl = 
Ve . The free Lagrangian takes the form 


C = - raicViVh] + (1-^2). (7.44) 

Consider for simplicity the case of equal masses, so that we can make the notation more compact upon introducing 


which leads to: 



ip = 


C = Xhap^dftip — mc^ipip. 


(7.45) 


(7.46) 


This C is invariant under the transformation ijj —>> U'lp where U is any 2x2 unitary matrix. The general form of a 
SU(2) matrix is 


U = e 1Ta (7.47) 

where r is the Pauli matrix vector. Therefore, C is invariant under global SU(2) transformations. If we now insist 
on the SU(2) symmetry being local, we have to introduce gauge fields as before. For a local transformation, we let 
a = a(x^). Let us write this as: 


tp —> Sip, S = e-^-M^/nc 


(7.48) 
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which is a local SU(2) transformations. The original C is not invariant under this transformation, but we have 
already laid out a strategy for fixing this: we replace d /jL with the covariant derivative D fJ : 

Dr^dp + i^T-Ap. (7.49) 

The remaining task is to identify a transformation rule for the gauge fields A M so that 

D^^S(D^), (7.50) 

in which case we have an C that is invariant under local SU(2) transformations as can be verified by direct insertion 
of the above transformation. It turns out that (try to show this!) the following transformation rule for A fl does the 
trick: 


r • A'^ = S(t • A^ 1 + i (hc/q)(d /1 S)S- 1 . (7.51) 

It is particularly instructive to consider the special case of infinitesimal transformations where |A| is small. In that 
case, an expansion of S yields: 


d^S^-^r-d^X. (7.52) 

Inserted into Eq. (7.51), we obtain the transformation rule for the vector of gauge fields: 

+ + (7.53) 

With these transformation rules, the following Lagrangian: 

C = D ^ — me 2/ 0'0 

= — mc 2 ^ — • A M (7.54) 

is invariant under local SU(2) transformations. We should also include the free Lagrangian of the three new vector 
fields as well: 

< 7 - 55 > 

3 

These must be massless gauge fields, since at term m\A u A u would break SU(2) invariance. Also, we must 
revise the structure of F^ u compared to our previously used F^ v since the A field now has an extra term in the 
transformation rule Eq. (7.53): 


= o» A >' _ q» A v _ x An (7.56) 

he 

In conclusion, the full Yang-Mills Lagrangian describing two equal mass Dirac fields interacting with three mass¬ 
less gauge fields is: 


C = - mc 2 'ipip - (q^Tip) ■ A M - r^-F^ ■ F^ v . (7.57) 

Notice the form of the coupling • A^, exactly like in electroweak theory. 
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